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Sir: 

This Appeal Brief is presented in support of the Notice of Appeal to the Board of 
Patent Appeals and Interferences, mailed on May 3, 2005, from the Final Rejection of 
claims 1, 2, 10, 14, 16-18, and 21 of the above-identified Application, as set forth in the 
Final Office Action mailed on November 3, 2004, and the Advisory Action mailed on 
February 28, 2005. 

The Commissioner of Patents and Trademarks is hereby authorized to charge 
Deposit Account No. 19-0743 in the amount of 500.00 which rq)resents the requisite fee 
set forth in 37 C.F.R. § 41 .2(b)(2). The Appellant respectfully requests consideration and 
reversal of the Examiner's rejections of the pending claims. 
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1. REAL PARTY IN INTEREST 

The real party in interest of the above-captioned patent application is the assignee, 
INTEL CORPORATION. 
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2. RELATED APPEALS AND INTERFERENCES 

There are no other appeals, interferences, or judicial proceedings known to 
Appellant that will have a bearing on the Board's decision in the present appeal 
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3. STATUS OF THE CLAIMS 

The present application was filed on July 2, 1999 with claims 1 - 28. After a first 
Non-Final Office Action and Final Office Action were received, and timely responses 
filed thereto, a Continued Prosecution Application was filed on April 24, 2003. After 
four fiirther Non-Final Office Actions and a second Final Office Action (hereinafter "the 
Final Office Action") were received, and timely responses filed thereto, an Advisory 
Action was mailed on February 28, 2005. 

At this time, claims 1-28 are currently pending in the Application. Claims 24-28 
have been allowed. Objections have been raised with respect to claims 3-9, 1 1-13, 15, 
19, 20, 22, and 23 as being dependent on rejected base claims. Claims 1-2, 10, 14, 16-18, 
and 21 stand rejected, and their rejection is appealed herein. 
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4. STATUS OF AMENDMENTS 

No request for amendment has been made subsequent to that of claim 10 in 
conjunction with a response to the Final Office Action, mailed January 3, 2005. In the 
Advisory Action mailed on February 28, 2005, no indication was made as to whether this 
amendment has been entered. Since the amendment was made solely to provide clarity 
and consistency, and no new matter or other issues arise, it is assumed by the Appellant 
that the amendment has in fact been entered. For the convenience of the Board, the 
amendment is reproduced below: 

10. (Currently Amended) A method of preparing a circuit model for simulation, the 
circuit model having a model size, and th e m e thod comprising: 

merging a plurality of extended latch boundary components into a plurality of 
partitions having a partition size; and 

maintaining a load balance within the plurality of partitions. 
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5. SUMMARY OF CLAIMED SUBJECT MATTER 



This summary is presented in compliance with the requirements of Title 37 C.F.R. 
§ 41.37(c)(l)(v), mandating a "concise explanation of the subject matter defined in each 
of the independent claims involved in the appeal . . Nothing contained in this summary 
is intended to change the specific language of the claims described, nor is the language of 
this summary to be construed so as to limit the scope of the claims in any way. 

Some embodiments of the invention are related to a method of preparing a circuit 
model for simulation. The method may include decomposing the circuit model (having a 
number of latches) into a plurality of extended latch boundary components, and 
partitioning the pliu-ality of extended latch boundary components. (Application, 
Independent claim 1; FIG. 2; pg. 2, lines 6-8 and pg. 3, line 20 - pg. 4, line 8). The 
method may also include merging a plurality of extended latch boundary components into 
a plurality of partitions having a partition size and maintaining a load balance within the 
plurality of partitions. (Application, Independent claim 10; FIG. 5; and pg. 6, line 27 - 
pg. 7, line 9). The method may also include grouping a plurality of extended latch 
boundary components into a plurality of partitions and reducing the communication time 
within the plurality of partitions by adjusting the grouping. (Application, Independent 
claim 14; FIG. 6; and pg. 7, lines 10-24). 

Some embodiments of the invention are related to a method of forming an 
extended latch boundary component. The method may include selecting a path having a 
first node selected firom a group consisting of latches and primary outputs and a second 
node selected fi-om a group consisting of latches and primary inputs, wherein the path can 
include a latch between the first node and the second node. (Application, Independent 
claim 16; FIG. 1; and pg. 3, lines 5-19.) 

Some embodiments of the invention are related to a latch boundary component 
including a path comprising a plurality of first nodes selected fi-om a group consisting of 
latches and primary outputs and a plurality of second nodes selected fi-om a group 
consisting of latches and primary inputs, where the path can include a plurality of latches 
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between the plurality of first nodes and the plurality of second nodes. (Application, 
Independent claim 17; FIG. 1; and pg. 3, lines 5-19.) 

Some embodiments of the invention are related to a method of sharing a repeated 
circuit structure in a circuit model. The method may include expanding the repeated 
circuit structure once to form an expanded circuit structure and grafting the expanded 
circuit structure to the circuit model as needed. (Application, Independent claim 18; FIG. 
7; and pg. 7, line 25 - pg. 8, line 5). 

Some embodiments of the invention are related to a method of simulating a circuit 
model. The method may include partitioning a plurality of extended latch boundary 
components to form a plurality of partitions having a size, preparing a pliu-ality of 
simulations fi-om the plurality of partitions, and executing the plurality of simulations on 
a processing unit. (Application, Independent claim 21; FIG. 9; and pg. 8, line 14 - pg. 9, 
line 2). 
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6. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

6.1 Claim 18 stands rejected under 35 USC § 102(b) as being anticipated by Beausang 
et al. (U.S. 5,828,579; hereinafter "Beausang-3"). [Note: the references are 
numbered to coincide with the designations assigned by the Examiner in the Final 
Office Action.] 

6.2 Claims 10, 16 and 17 stand rejected under 35 USC § 102(b) as being anticipated by 
Beausang et al. (U.S. 5,903,466; hereinafter "Beausang- 1"). 

6.3 Claim 14 stands rejected under 35 USC § 103(a) as being unpatentable over 
Beausang-1 in view of Beausang-3. 

6.4 Claims 1-2 and 21 stand rejected under 35 USC § 103(a) as being unpatentable over 
Beausang-1 in view of Beausang et al. (U.S. 5,949,692; hereinafter "Beausang-2"). 
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7. ARGUMENT 

7.1 The Applicable Law 

Anticipation under 35 USC § 102 requires the disclosure in a single prior art 
reference of each element of the claim under consideration. See Verdegaal Bros, K 
Union Oil Co, of California, 814 F.2d 628, 631, 2 USPQ 2d 1051, 1053 (Fed. Cir. 1987). 
It is not enough, however, that the prior art reference discloses all the claimed elements in 
isolation. Rather, "[ajnticipation requires the presence in a single prior reference 
disclosure of each and every element of the claimed invention, arranged as in the claim'' 
Lindemann Maschinenfabrik GmbHv, American Hoist & Derrick Co,, 730 F.2d 1452, 
221 USPQ 481, 485 (Fed. Cir. 1984) (citing Connell v. Sears, Roebuck & Co,, 111 F.2d 
1542, 220 USPQ 193 (Fed. Cir. 1983)) (emphasis added). "The identical invention must 
be shovra in as complete detail as is contained in the ... claim." Richardson v. Suzuki 
Motor Co,, 868 F.2d 1226, 1236, 9 USPQ2d 1913, 1920 (Fed. Cir. 1989); MPEP § 2131 
(emphasis added). 

The Examiner has the burden under 35 U.S.C. § 103 to establish a prima facie 

case of obviousness. In refine, 837 F.2d 1071, 1074, 5 U.S.P.Q.2d (BNA) 1596, 1598 

(Fed. Cir. 1988). The M.P.E.P. contains explicit direction to the Examiner that agrees 

with the In re Fine court: 

In order for the Examiner to establish a prima facie case of obviousness, 
three base criteria must be met. First, there must be some suggestion or 
motivation, either in the references themselves or in the knowledge 
generally available to one of ordinary skill in the art, to modify the 
reference or to combine reference teachings. Second, there must be a 
reasonable expectation of success. Finally, the prior art reference (or 
references when combined) must teach or suggest all the claim limitations. 
The teaching or suggestion to make the claimed combination and the 
reasonable expectation of success must both be found in the prior art, and 
not based on applicant's disclosure. M.P,E,P. § 2142 (citing In re Vaeck , 
947 F.2d 488, 20 U.S.P.Q.2d (BNA) 1438 (Fed. Cir. 1991)). 

The requirement of a suggestion or motivation to combine references in a prima facie 
case of obviousness is emphasized in the Federal Circuit opinion. In re Sang Su Lee, 111 
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F.3d 1338; 61 U.S.P.Q.2D 1430 (Fed. Cir. 2002), which indicates that the motivation 
must be supported by evidence in the record. 

The test for obviousness under § 103 must take into consideration the invention as 
a v^hole; that is, one must consider the particular problem solved by the combination of 
elements that define the invention. Interconnect Planning Corp, v. Feil, 11 A F.2d 1 132, 
1 143, 227 U.S.P.Q. 543, 551 (Fed. Cir. 1985). References must be considered in their 
entirety, including parts that teach away from the claims. See MPEP § 2141.02. 



7.2 The References 

Beausang'l: discloses a scan insertion process for complex circuit synthesis that 
has a reduced set of constraint-driven compiler optimizations. See Beausang-1, Col. 4, 
lines 46-48. A three-tiered performance optimization is used within the insertion process: 
the first tier optimizes the design for test elements; the second tier optimizes all elements 
in the design (in addition to implementing the first tier); and the third tier performs 
sequential optimization, as well as local optimization (in addition to implementing the 
second tier). See Beausang-1, Col. 4, lines 50-59. 

Beausang-2: describes a synthesis process of scan insertion/replacement and 
routing. See Beausang-2, Col. 31, lines 48-56. Scan resources may be inserted into an 
integrated circuit design organized into hierarchical modules. See Beausang-2, Col. 3, 
lines 52-54. 

Beausang-3: is directed toward a scan chain design database that defines a 
hierarchical circuit design as anetlist of logic cells, some of which contain scan structiu-e. 
See Beausang-3, Col. 6, lines 47-66. Inferred scan structures may be recognized and 
replaced with explicit scan structures. See Beausang-3, Col. 7, lines 7-28. 

7.5 Discussion of the Rejections 

7.3.1 The Rejections Under S 102: 

Claim 18 was rejected under 35 USC § 102(b) as being anticipated by Beausang- 
3. Claims 10, 16 and 17 were rejected under 35 USC § 102(b) as being anticipated by 
Beausang-1. The Appellant does not admit that Beausang-1 or Beausang-3 are prior art, 
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and reserves the right to swear behind these references at a later date. In addition, 
because the Appellant asserts that the Office has not shown that Beausang-1 or 
Beausang-3 disclose the identical invention as claimed, the Appellant respectfully 
traverses these rejections of the claims. 

With respect to claim 18, it is noted that Beausang-3 is directed toward a scan 
chain design database that defines a hierarchical circuit design as a netlist of logic cells, 
some of which contain scan structure. See Beausang-3, Col. 6, lines 47-66. While the 
assertion is made in the Final Office Action that Beausang-3 Figs. 6A, 6B, and 7B 
illustrate various elements of the claim, it is respectfully noted that the cited elements are 
actually shown as part of creating balanced scan chains, and not sharing a repeated circuit 
structure in a circuit model, as claimed by the Appellant. 

FIGs. 6A and 6B of Beausang-3 show before and after snapshots of synthesis 
results for two different chains having a single clock domain, while FIGs. 7A and 7B 
show before and after snapshots of synthesis results for two different chains having a 
mixed clock edge design. See Beausang-3, Col. 33, lines 5-50. The Appellant was 
unable to locate the activities of "expanding the repeated circuit structure" (there is no 
repeated structure shown) and "grafting the expanded structure" (since there is no 
expanded, repeating structure) within the bounds of Beausang-3, as claimed by the 
Appellant. 

The Office attempts to explain similarities between Beausang-3 and the claimed 
embodiment by asserting that the insertion of Beausang-3 's latch 713 into level 712 (see 
Beasang-3, FIGs. lOA-lOB) represents "expansion of a repeated circuit structure." A 
similar assertion is made with respect to the activity represented by block 335 of 
Beausang-3 's FIG. 2 A (where an unused chain is added to the scan plan - the imused 
chain being user-defined and having no scan structure associated with it). See Beausang- 
3, Col. 27, lines 15-23. However, neither of these assertions support the limitation of 
"expanding the repeated circuit structure once to form an expanded circuit structure" 
claimed by the Appellant, since no method of sharing a repeated circuit structure is 
shown. 
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As to claim 10, it appears that an element claimed by the Appellant (i.e., "merging 
a plurality of extended latch boimdary components into a plurality of partitions") was not 
addressed in the Final Office Action, and the Appellant was unable to find any indication 
of their existence in Beausang-1. The Office attempts to explain that the mere presence 
of equal numbers of latch elements 307 and 321 in Beausang-1, FIG. 3B equates to 
"maintaining a load balance within the plurality of partitions," as claimed by the 
Appellant. However, this is not the case. As noted in Beausang-1, "a scan replacement 
process . . . replaces the non-scan memory cells 307a-e of imit 301 with scannable 
memory cells 320a-e (FIG. 3B) . . Thus, there is no load balance maintenance 
whatsoever; the process described by Beausang-1 is one of replacing one set of elements 
with another, and not balancing between partitions. See Beausang-1, Col. 9, lines 40-43. 

As to claim 16, it is respectfiilly noted that there is no indication that the circuit 
combination shown in FIG. 13B of Beausang-1 is a result of the claimed process (i.e., a 
method of forming an extended latch boundary component comprising "selecting a path 
having a first node selected fi:-om a group consisting of latches and primary outputs and a 
second node selected fi'om a group consisting of latches and primary inputs"). 

A question regarding what may comprise a "primary input" or "primary output" 

was raised in the Final Office Action. These terms are well-understood by those of skill 

in the art of circuit synthesis. Numerous references make use of these terms. For 

example, as explained in Wireplanning in Logic Synthesis: 

"A logic circuit L is a 3 -tuple (I, O, F). I is a set of primary input pins, or 
simply primary inputs. O is a set of primary output pins, or simply 
primary outputs. Each element of I an O is a binary variable." 
Wireplanning in Logic Synthesis, ICCAD98, pg. 26 (attached hereto as 
part of Appendix B, and entered into the record as part of the response to 
the Final Office Action). 

Similarly, as noted in Logic Synthesis Preserving High-Level Specification: 

"Let S be a single output combinational circuit of multi-valued blocks 
specified by a directed acyclic graph H. The sources and the sink of H 
correspond to primary inputs and the output of S. Each non-source node of 
H corresponds to a multivalued block computing a multi-valued function 
of multivalued arguments. Each node of n of H is associated with a multi- 
valued variable A. If n is a source of H, then the corresponding variable 
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specifies values taken by the corresponding primary input of S, If n is a 
non-source node of S then the corresponding variable describes the values 
taken by the output of the block specified by n. If n is a source 
(respectively the sink), then the corresponding variable is called a primary 
input variable (respectively primary output variable)." Logic Synthesis 
Preserving High-level Specification, E.Goldberg (Cadence Berkeley 
Labs, USA), located at http://eigold.tripod.com/papers/iwls-20Q4.pdf 
(attached hereto as part of Appendix B, and entered into the record as part 
of the response to the Final Office Action). 

Thus, as to claim 17, Beausang-1 does not teach either "primary inputs" (e.g., 
only a single primary input 307 is shown in FIG. 3 A), nor "primary outputs" (none are 
shown) as set forth in the claim (i.e., a path comprising a plurality of first nodes selected 
fi*om a group consisting of latches and primary outputs and a plurality of second nodes 
selected from a group consisting of latches and primary inputs, where the path can 
include a plurality of latches between the plurality of first nodes and the plurality of 
second nodes"). 

Finally, while it is asserted in the Office Action that Beausang-1 teaches the 
combination of "a plurality of first nodes (Figure 15A . . .), with a plurality of output 
latches (Figure 3B)," the Appellant can find no logical connection between these two 
elements within the bounds of Beausang-1. The cited portions of this reference (FIGS. 
13B, 15 A, and 3B) are directed to a scan insertion process to (i) move a load from a Q 

logic output to a g logic output to adjust the load input phase; (ii) move a load to a 
logically-equivalent driving input so the original driver can be downsized; and (iii) 
replace HDL-specified, non-scan memory cells with DFT scannable memory cells. See 
Beausang-1, Col. 6 line 15 - Col. 7, line 6; and Col. 24, line 58 - Col. 25, line 24. Thus, 
it appears that the cited combination simply does not exist according to the teachings of 
Beausang-1. 

It is respectfully noted that "[t]he identical invention must be shown in as 
complete detail as is contained in the ... claim." Richardson v. Suzuki Motor Co., 868 
F.2d 1226, 1236, 9 USPQ2d 1913, 1920 (Fed. Cir. 1989); MPEP § 2131 (emphasis 
added). Since what is disclosed by Beausang-1 and Beausang-3 is not identical to the 
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subject matter of the embodiments claimed, the rejection of claims 10 and 16-18 under § 
102(b) is improper. Reconsideration and allowance are respectfully requested. 

73.2 The Rejections Under S 103: 

Claims 1, 2 and 21 were rejected under 35 USC § 103(a) as being unpatentable 
over Beausang-1 in view of Beausang-2. Claim 14 was rejected under 35 USC § 103(a) 
as being unpatentable over Beausang-1 in view of Beausang-3. First, the Appellant does 
not admit that Beausang-1, Beausang-2, or Beausang-3 are prior art, and reserves the 
right to swear behind these references in the future. Second, since a prima facia case of 
obviousness has not been established in each case, the Appellant respectfully traverses 
these rejections. 

No proper prima facie case of obviousness has been estabUshed because (1) 
combining the references does not teach all of the limitations set forth in the claims, (2) 
there is no motivation to combine the references, and (3) combining the references 
provides no reasonable expectation of success. Each of these points will be explained in 
detail, as follows. 

Combining References Does Not Teach All Limitations: First, with respect to 
independent claims 1 and 21, no combination suggested in the Office Action renders all 
of the claim limitations. It is admitted by the Office that Beausang-1 does not disclose 
"partitioning the pluraUty of extended latch boundary components" as claimed by the 
Appellant. Neither does Beausang-2. 

Beausang-2 is directed to a synthesis process of scan insertion/replacement and 
routing. See Beausang-2, Col. 31, lines 48-56. The Appellant was unable to find any 
reference within the bounds of Beausang-2 to partitioning "extended latch boundary 
components." Rather, the figures cited in the Office Action (Beausang-2, FIGs. 2A and 
6A-14B) illustrate various portions of scan chains; partitioning extended latch boimdary 
components to serve any of the purposes noted by the Appellant in the Application is not 
taught. The text cited in the Office Action (Beausang-2, Col. 26, line 50 - Col. 27, line 
17) refers to selecting "partition blocks" to balance scan chains, and not to parititioning 
extended latch boundary components. In Beausang-2, the logic cells of the database 



APPELLANTS' BRIEF ON APPEAL Page 15 

Serial Number: 09/347,690 Dkt: 884.107US1 (INTEL) 

Filing Date: July 2, 1999 

Title: LOGIC VERIFICATION IN LARGE SYSTEMS 
Assignee: Intel Corporation 

remain unaltered. Thus, independent claims 1 and 21 are nonobvious. This conclusion 
applies with even greater force respecting dependent claim 2, since any claim depending 
from a nonobvious independent claim is also nonobvious. See M.P.E.P. § 2143.03. 

Second, with respect to independent claim 14, the Office admits that Beausang-1 
does not disclose "grouping a plxxrality of extended latch boundary components into a 
plurality of partions", as claimed by the Appellant. Neither does Beausang-3. 

Beausang-3 is also directed to a synthesis process of scan insertion/replacement 
and routing. See Beausang-3, Col 32, lines 22-42. The Appellant was imable to find any 
reference within the bounds of Beausang-3 to "grouping a plurality of extended latch 
boundary components into a plurality of partions." Rather, the cited text of Beausang-3 
(Beausang-3, Col. 12, lines 45-55) serves to define scan groups, scan links, and scan 
chains; grouping extended latch boundary components to serve any of the purposes noted 
by the Appellant in the Application is not taught. Thus, independent claim 14 is 
nonobvious. 

No Motivation to Combine References: The Office asserts that one would be 
motivated to combine Beausang-1 with Beausang-2 or Beausang-3 so that a designer 
would be able to "sign off his or her work at the completion of module design, without 
later disruption. See Office Action, Paper 10-24-04, Pg. 8. However, as pointed out in 
the Office Action, Beausang-1 aheady provides an enabling solution. 

The assertion is made by the Office that, nevertheless, one of skill in the art 
"would have been motivated to find a better solution to decrease the amount of time 
required to compile a design." It is fiirther asserted that it would therefore have been 
obvious to combine Beasang-1 with either Beausang-2 or Beausang-3 so that "the IC 
designer can now 'sign off his or her work at the completion of the module design" so 
that completed/optimized modules do not later have to be disrupted. However, the 
Appellant do not understand that enabling designer sign-off necessarily satisfies the 
motivation noted in the Office Action. In fact, the combination may require extra time to 
finalize individual module designs, potentially creating a longer compilation time, since 
each module would then have to be perfected by its individual designer prior to top level 
analysis. See Beausang-2, Col. 3, lines 3-11 and Beausang-3, Col. 3, lines 4-12. 
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Beausang-1 teaches the need for a "time efficient design synthesis system operable within 
a computer implemented CAD system . . .". Beausang-1, Col. 4, lines 22-24. 
Implementing the combination suggested by the Office would operate to increase the 
time required to synthesize designs, by removing them from the computer-implemented 
realm. This contradicts the express intention of Beausang-1, and thus, provides no 
motivation to combine Beausang-1 with either Beausang-2 or Beausang-3. 

Finally, it is noted in the Office Action that all three references have the same 
inventor, James Beausang, and that this somehow constitutes a motivation to combine 
them. This assertion disregards the fact that many inventors create mechanisms that have 
no relation to one another. Further, combining references in a way that prevents 
achieving the goals expressed in those references does not constitute a motivation to 
combine them. 

Since Beausang-1 teaches away fi^om the suggested combinations, there is no 
motivation to make them. In addition, the unsupported assertions in the Office Action 
fail to satisfy the explicit requirements needed to demonstrate motivation as set forth by 
the In re Sang Su Lee court. Therefore, the Examiner appears to be using personal 
knowledge, and is again respectfiiUy requested to submit an affidavit as required by 37 
C.F.R. § 1.104(d)(2). To date, no such affidavit has been received. 

No Reasonable Expectation of Success: As has been previously noted, 
modifying Beausang-1 to implement independent completion of modules by various 
designers may create additional barriers to overall design completion, without providing a 
"system that can reduce the time required to perform circuit synthesis ..." See 
Beausang-1, Col. 4, lines 18-19. Introducing a human element into an electronic design 
process rarely speeds anything up; in fact, "[djesign, checking and testing of large scale 
integrated circuits are so complex that the use of programmed computer systems are 
required for realization of normal circuits." Beausang-1, Col. 1, lines 21-23. 

In addition, as noted above, several elements of claims 1-2, 14, and 21 are not 
taught by any of the cited references. Thus, there is no reasonable expectation that any 
combination of Beausang-1, Beausang-2, and Beausang-3 will provide these elements, 
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including "decomposing", "partitioning", and "grouping" extended latch boundary 
components, as claimed by the Appellant. 

The test for obviousness under § 103 must take into consideration the invention as 
a whole; that is, one must consider the particular problem solved by the combination of 
elements that define the invention. Interconnect Planning Corp, v. Feil, 11 A F.2d 11 32, 
1 143, 227 U.S.P.Q. 543, 551 (Fed. Cir. 1985). References must be considered in their 
entirety, including parts that teach away from the claims. See MPEP § 2141 .02. The fact 
that references can be combined or modified does not render the resultant combination 
obvious unless the prior art also suggests the desirability of the combination. In re Mills, 
16 USPQ2d 1430 (Fed. Cir. 1990); M.P.E.P. § 2143.01. 

Therefore, since there is no evidence in the record to support disclosure by either 
Beausang-1, Beausang-2, or Beausang-3 of "decomposing", "partitioning", and 
"grouping" extended latch boundary components, since there is no motivation to combine 
the references (e.g., since the references teach away fi"om such a combination), and since 
no reasonable expectation of success arises, a prima facie case of obviousness has not 
been established with respect to independent claims 1,14, and 21. This conclusion also 
applies to dependent claim 2, since any claim depending fi-om a nonobvious independent 
claim is also nonobvious. It is therefore respectfiilly requested that the rejections of 
claims 1-2, 14, and 21 xmder 35 U.S.C. § 103 be reconsidered and withdrawn. 
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8, SUMMARY 

It is respectfully submitted that no prima facie case of anticipation under 35 
U.S.C. §102, nor of obviousness under 35 U.S.C. §103 has been estabhshed by the 
Office. Therefore, it is respectfully requested that the rejections of claims 1-2, 10, 14, 16- 
18, and 21 be reconsidered and withdrawn. The Appellant respectfully submits that all of 
the claims are in condition for allowance and notification to that effect is eamestly 
requested. The Examiner is invited to telephone the Appellant's attomey, Mark MuUer at 
(210) 308-5677, or the undersigned attomey at (612) 349-9592, to facilitate prosecution 
of this Application. If necessary, please charge any additional fees or credit overpayment 
to Deposit Account No. 19-0743. 

Respectfully submitted, 

MANPREET S. KHAIRA ET AL. 

By their Representatives, 

SCHWEGMAN, LUNDBERG, WOESSNER & KLUTH, P.A. 
Attomeys for Intel Corporation 
P.O. Box 2938 

Minneapolis, Minnesota 55402 
(612) 373-6970 
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CLAIMS APPENDIX 

The Claims on Appeal. 

1 . (Original) A method of preparing a circuit model for simulation comprising: 
decomposing the circuit model having a number of latches into a plurality of 

extended latch boimdary components; and 

partitioning the plurality of extended latch boundary components. 

2. (Original) The method of claim 1, wherein decomposing a circuit model having a 
number of latches into a plurality of extended latch boundary components comprises: 

decomposing at least one of a plurality of hierarchical cells into one of the 
plurality of extended latch boundary components. 

10. (Previously Presented) A method of preparing a circuit model for simulation, the 
circuit model having a model size, comprising: 

merging a plurality of extended latch boundary components into a plurality of 
partitions having a partition size; and 

maintaining a load balance within the plurality of partitions. 

14. (Original) A method of preparing a circuit model for a simulation having a total 
simulation time, the method comprising: 

grouping a plurality of extended latch boundary components into a plurality of 
partitions; and 

reducing the communication time within the plurality of partitions by adjusting 
the grouping. 

16. (Original) A method of forming an extended latch boundary component 
comprising: 
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selecting a path having a first node selected from a group consisting of latches and 
primary outputs and a second node selected from a group consisting of latches and 
primary inputs, wherein the path can include a latch between the first node and the 
second node. 

17. (Original) A latch boundary component comprising: 

a path comprising a plurality of first nodes selected from a group consisting of 
latches and primary outputs and a plurality of second nodes selected from a group 
consisting of latches and primary inputs, where the path can include a plurality of latches 
between the plurality of first nodes and the plurality of second nodes. 

18. (Original) A method of sharing a repeated circuit structure in a circuit model, the 
method comprising: 

expanding the repeated circuit structure once to form an expanded circuit 
structure; and 

grafting the expanded circuit structure to the circuit model as needed. 



21 . (Original) A method of simulating a circuit model, the method comprising: 

partitioning a plurality of extended latch boundary components to form a plurality 

of partitions having a size; 

preparing a plurality of simulations from the plurality of partitions; and 
executing the plurality of simulations on a processing unit. 
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EVIDENCE APPENDIX 



The evidence noted below and attached hereto include copies of documents 
submitted as an integral part of the Response by the Appellant to the Final Office Action 
mailed by the Office on November 3, 2004, and presented for consideration by the Examiner 
when preparing the Advisory Action mailed February 28, 2005. 

INSERT #1: WIREPLANNING IN LOGIC SYNTHESIS, 8 pgs. 

INSERT #2: LOGIC SYNTHESIS PRESERVING HIGH-LEVEL SPECIFICATION, 8 
pgs. 
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RELATED PROCEEDINGS APPENDIX 



NONE. 
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Logic synthesis preserving higli-level specification 



E.Goldberg (Cadence Berkeley Labs, USA), 



Abstr&Ct. In this paper we develop a method of logic synthesis 
that preserves high-level structure of the circuit to be 
synthesized. This method is based on the fact that two 
combinational circuits implementing the same "high-level" 
specification can be efficiently checked for equivalence. Hence, 
logic transformations preserving a predefined specification can 
be made efficiently. We introduce the notion of toggle equivalence 
of Boolean functions and show that toggle equivalence can be 
used for making gate level transformations that preserve a 
predefined specification. We describe a practical procedure for 
checking toggle equivalence of two Boolean circuits and give 
experimental data about its performance. 

1. Introduction 

In a typical design flow, by the time a circuit is passed 
to a logic synthesis procedure, the "high-level" structure of 
this circuit is lost. Suppose, for example, that a 
combinational circuit is initially described as a network of 
multi-valued blocks. After encoding all the multi-valued 
variables, this network is replaced with a Boolean circuit 
that is optimized using a set of local transformations that 
ignore the original high-level structure of the circuit. 

The flaw of the approach above is that in the case of a 
poor choice of encodings for multi-valued variables, a 
synthesis procedure using local transformations will not be 
able to "correct*' these encodings. On the other hand, 
finding good encodings at the level of multi-valued blocks 
is hard and so the probability of generating bad encodings 
is high. A possible solution to the problem is to perform 
logic transformations that re-encode multi-valued variables 
"implicitly" at the gate level. We will call such 
transformations High-level structure aware Logic 
Synthesis (HLLSf). This is because re-encoding of multi- 
valued variables implicitly, essentially means synthesizing a 
circuit that is a different implementation of the same 
"specification" as the original circuit. An HLLS procedure 
can be used as an extra optimization step taken before using 
logic synthesis based on local transformations. 

In this paper we introduce a method of HLLS for 
combinational circuits. To design an HLLS procedure one 
has to solve two problems. The first problem is to verify the 
correctness of logic transformations. A "regular" logic 
synthesis procediire makes local transformations so the 
equivalence checking of the original and optimized circuits 
is usually not an issue. On the other hand, an HLLS 
procedure performs **non-locar' synthesis transformations, 
so verification of the correctness of such transformations 



may pose a problem. This problem was addressed in [3], [4] 
where it was shown that if two Boole^ circuits have a 
common specification (CS), their equivalence checking is 
"easy". Informally, circuits Nx and have a CS if they can 
be considered as two different implementations of a circuit 
S of multi-valued gates further referred to as blocks. (5 is 
called a CS of Nx and A^2). In [3][4] it was proven that 
given a CS 5 of circuits Nx and Ni, there is an equivalence 
checking procedure whose conq)lexity is linear in the 
number of blocks of S and exponential in the granularity of 
S (the "size" of the largest block of S). An exan^le of 
circuits ^1,^2 having a common specification of three 
blocks is shown in Fig. 1. The specification itself is shown 
on the left. Here Nx{G}^ and ^2(Gy are subcircuits of AT, 
and ^2 respectively in^)Iementing the same block of 
specification. 
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Figure 1 Circuits and with a common specification of 
three blocks 

The second problem one has to solve in HLLS, is to 
find a way to preserve a predefined specification by 
making transfonmations at the gate level. Solving this 
problem is the focus of this paper. We introduce the 
notion of toggle equivalence of multi-output Boolean 
functions that is a generalization of regular functional 
equivalence. We show that circuits ^1 and Nt have a CS if 
they can be partitioned into toggle equivalent subcircuits 
(see Section 4.). This result suggests a simple way to 
perform HLLS. Suppose that a specification S of circuit ^^^i 
to be optimized is specified by partitioning the latter into 
subcircuits. Then, if we replace subcircuits of Nx with 
•'better'* subcircuits that are toggle equivalent to replaced 
ones, we produce a circuit that implements the same 
specification S as iV,. To be viable, HLLS needs an efficient 
algorithm for checking toggle equivalence. We describe 
such an algorithm and demonstrate its efficiency on 
benchmark circuits . 

The paper is structured as follows. In Section 2 we 
formally define the notion of a common specification. In 



Section 3 we reformulate the algorithm of equivalence 
checking from [3] removing some redundancy. Section 4 
introduces the notion of toggle equivalence of Boolean 
functions. The relation between the notions of toggle 
equivalence and conunon specification is shown in Section 
5. In Section 6 we describe a method of HLLS. The 
relation of HLLS to other synthesis procedures is discussed 
in Section 7. The description of a procedure for checking 
toggle eqmvalence and its performance on benchmark 
circuits are given in Section 8. Finally, we draw some 
conclusions in Section 9. 



2. Definition of comnion specification 

In this section, we formally define the notion of a 
common specification of Boolean circuits. Let 5 be a single 
output combinational circuit of multi-valued blocks 
specified by a directed acyclic graph H, The sources and 
the sink of H correspond to primary inputs and the output of 
S, Each non-source node of H corresponds to a multi- 
valued block computing a multi-valued function of multi- 
valued arguments. Each node ofnofH is associated with a 
mulii'Vaiued variable A, If n is a source of H , then the 
corresponding variable specifies values taken by the 
corresponding primary input of If n is a non-source 
node of .S^ then the corresponding variable describes the 
values taken by the output of the block specified by n. If n 
is a source (respectively the sink), then the corresponding 
variable is called a primary input variable (respectively 
primary ou^ut variable). We will use the notation 
C=G(i4|^2r- .» ^k) to indicate that a) the output of a block 
G is associated with a variable C; b) the function conqiuted 
by the block G is G{Ai,A2,..., ^0; c) only k nodes of H axe 
connected to the node n in H and outputs of these nodes 
are associated with variables ^4 1^2,. . 

Denote by D(A) the domain of the variable A 
associated with a node of H. The value of \D(A)\ is called 
the multiplicity of A. If the multiplicity of every variable A 
ofS is equal to 2 then 5 is a Boolean circuit. 

Now we introduce the notion of a specification of a 
single output Boolean circuit N, Informally, a multi-valued 
circuit 5 is a specification of if can be obtained from S 
by picking proper encodings of internal variables of S, In 
the following exposition any multi-valued network is called 
a specification. 

Definition 1. Let D(A)={a^,,.,^^} be the domain of a 
variable A of S, Denote by q(A) a Boolean encoding of the 
values of D{A) which is a mapping q:D{A)~^{0,l}" such 
that a.^:^a^^ q{a.^ ) ^ q(a. ). The value of q(a, ), a. e D(A) is 
called the code of a.. Denote by length{q{A)) the number of 
bits in q{a^) i.e. the value of m. Denote by v{A) the set of w 
coding Boolean variables. 



In this paper, we make the assumptions below about 
specifications and implementations 
Assumption L A specification contains only one output. 
All primary input variables and the primary output variable 
of a specification are Boolean. 

Assumption 2. Every gate of a Boolean circuit 
(inq)lementation) has two ii^uts and one output. Every 
multi-valued block of a specification has only one output 
but the number of ir^>uts of a block is not fixed 
Assumption 3. If Ai and A^ arc two different variables of a 
specification , then v(i4i) K^j) =0. 
]>efinition2. Let OGiA.Aj^ .Ad a block of 

specification S. Let q{A^,.,,q{A^,q{C) be encodings of 
variables A^A^,.,yA^ and C respectively. A Boolean circuit 
N is said to implement the block G xf N implements the 
completely specified Boolean function 
(^C)=G{q{A^),..,q{AJ) whose truth table is obtained from 
that of G by replacing values of A^.^JC with their codes. 
Definition 3. Let 5 be a multi-valued circuit. A single 
output Boolean circuit N is said to implement the 
specification 5, if N can be built firom S by the following 
two rules. 

1) Each block G of ^ is replaced with its implementation 
(denote it by N(G)). 

2) Let the output of block G\ (specified by variable C) be 
connected to an input of block G2 (specified by the same 
variable Q in S, Then the outputs of the circuit N{Gi) are 
properly connected to inputs of N(Gt). Namely, if a primary 
output of N{G\) connected to an input of NiGz) these input 
and output are specified by the same coding variable of 

Remark 1. It is important to emphasize that the fact that a 
circuit has a specification S does not necessary mean that 
N is produced from S by encoding it multi-valued 
variables. It just means that N can be produced from S. 
Remark 2. Let N be an implementation of a specification 
S. Let p be the largest number of gates used in an 
implementation of a multi-valued block of S in N. We will 
say that 5 is a specification of granularity p for M. 
Definition 4. Let N^, be two functionally equivalent 
single ou^ut Boolean circuits. Let implement a 

specification S. Then .S^ is called a common specification 
(CS) ofA^.andyVj. 

Definitions. LeiS be a CS of A^„Ar,. Let (respectively 
be the granularity of S with respect to (respectively 
N^. Then we will say that 5 is a CS of N^J^^ of granularity 
p = maxip,^;). 

3. Equivalence checking with a known 
common specification 

In this section, we recall the equivalence checking 
algorithm of [3] and give a slightly modified version of it. 



Let Ni and 1^2 be Boolean circuits with a CS 5. Let Gk be a 
block of 5. We will denote by N^iG^ and NjiC;) the 
implementations of the block in Ni and Ni respectively. 
Definition 6. Let 5 be a CS of and The topological 
level of a block in a specification S is the length of the 
longest path from a primary input of 5 to G^. (The length of 
a path is measured in the number of blocks on it. The 
topological level of a primary input is assumed to be 0.) 
Denote by level(GJ the topological level of Gk in S, Denote 
by level(NiG)J) the topological level of implementation of 
block Gk in /=1,2 that is assumed to be equal to 
leveliGO- 

Definition 7. Let N^G^) and N/^G^) be implementations of 
a multi-valued block G^ whose output is associated with 
variable C. Let q^{Q and gj[Q be encodings of the variable 
C used in implementations N,(G^) and Nj(^G^). Function 
0(^,(0,^2(0) is called a correlation function of 
encodings q^Q^q^JO) if 

a) Q(z„ for any assignment z, to v,(0 and to Vj(0 
such that z^=qfc) andz,=^j(^) where c e Z)(C). 

b) Otherwise zj=0. 

Definition 8. Let be a Boolean circxiit. Denote by v{N) be 
the set of Boolean variables associated with the output of 
gates of N . Denote by Sat{y{N)) the Boolean function such 
that Sat{z)==l iff the assignment z to variables v{N) is 
''possible" i.e consistent. For example, if circuit N consists 
of just one AND gate y=x^ a x^, then v{Ny=^{y,x^yX^} and 
Sat{v(N))= (-jf , V V A (jc, V -y) a (x, v -y). 
Definition 9. Let /be a Boolean function. We will say that 
function y* is obtained from /by existentially quantifying 
away variable x ifJ*=J{... ;c=0, . . . ) v y( . . . ^ 1 , . . . .). 

In [3] it was shown that if and ^2 ^ve a CS one 
can check them for equivalence in the time linear in the 
number of blocks of S and exponential in the granularity of 
S. The essence of that algorithm is to compute so-called 
filtering and correlation functions in topological order. Here 
we give a modified version of this algorithm. The 
modification is that we discard computation of filtering 
functions from the algorithm. 

Here is an informal proof that confutation of filtering 
functions is not necessary. Let C be the variable associated 
with the output of a block Gk of S, From definitions of 
filtering (denoted by FJ) and correlation functions given in 
[3] it follows that 

So filtering fimctions can be dropped from the proof of 
Proposition 7 of [3] used to formulate the main result i.e. 
Proposition 8. (The use of filtering functions makes sense 
though, if one relaxes the definition of a conrelation 
function. However, in this paper we stick to the definition 
of a correlation function given in [3].) 



In the modified algorithm, only correlation functions 
are conq^uted in topological order of Ni and ^2- The 
algorithm starts with block implementations Ni(G)dJ^2(G}) 
of level 1 then process implementations of level 2 and so 
on. Let teve/(A^,(G0hievel(iN^2(Gic))=l (i.e. inputs of A^,(Gk) 
and A^2(<a) are primary inputs of Ni and Ni.) Let C be the 
variable associated wi^ the ou^ut of G^ and 
Q(vi(0,V2(0) be the correlation function relating 
encodings gi(C) and ^2(0- This function is obtained from 
the function Sat(v(Ni(G}J) a 5'a/(v(^2(^A)) by existentially 
quantifying away all the variables except the variables 
associated with the outputs of Ni{G^ and Nz^G}). 

Suppose level{Nx{Gv))^level{Ni(G)^)=k and the 
correlation fimctions have been computed for the 
implementations of levels less than k. Let the output of G^ 
be associated with variable Ck. Let the inputs of G^ be 
connected to the outputs of blocks Gtci,...,Giun associated 
with variables C)cu..,C|a„ respectively. Then the correlation 
function Q(vi(Ck),V2(Cit)) is obtained from the function 
Q(v,(Ck,),V2(Ck,)) A .. a Q(v,(Cfa„),V2(CkJ) A Sat(y{N,{G^) 
A •S'ar(v(^2(Gk))) by existentially quantifying away all the 
variables except ones associated with the outputs of Ar,(Gk) 
and Ni{G^. Eventually the correlation function Cfifi/2) is 
conq)Uted where/ and>2 are Boolean variables specifying 
the outputs of A^i and N2. If CM/2) = (/iv -/j) a (-/v/) 
(which is the equivalence function), then circuits ^1 and ^2 
are functionally equivalent. 

Definition 10* Given two functionally equivalent Boolean 
circuits N^, S is called the finest common specification 
if it has the smallest granularity p among all the CSs of N^ 

The complexity of the described algorithm is 
exponential in the granularity of S (i.e. in the size of the 
maximal subcircuit Ni(G}^). So to evaluate the con^lexity 
of equivalence checking of Ni and ^^2 correctly, one needs 
to know the finest CS of ^1 and ^2 or a CS whose 
granularity is close to that of the finest one. 



4. Toggle equivalence of Poolean functions 

In this section, we introduce the notion of toggle 
equivalence. We also show that toggle equivalent Boolean 
functions can be considered as different implementations of 
the same multi-valued function. 

Definition 11. Let/;:{0.1}" ^ {0,1}" and /, {0,1}" ^ 
{0,1}^ be m-output and ^-output Boolean functions of the 
same set of variables. Functions/ and / are called toggle 
equivalent if /(jc) ?t/(jcO <=> /,(jt) ^/.(x'). Circuits //, and 
implementing toggle equivalent functions / and / are 
called toggle equivalent circuits. 



Remark 3. Toggle equivalence means that for any pair of 
input vectors jc, x' for which at least one output of ft 
''toggles", the same is true for^ and vice versa. 
Deflnition 12, Let /be a multi-output Boolean function ofn 
arguments. Denote by Part{f) the partition of the set {0,1}" 
into blocks 5,,...^^ such thsXfix) -=J[x') if jr, jc'are in the 
same block mdfix) ^Jixf) if jc, jc 'are in different blocks. 
Proposition 1. Let / and /, be toggle equivalent. Then 
Part(f,)=Part{f^ i.e. for each block of Part(f^) there is a 
block B*^ of Part(Q such that B.^B ] and vice versa. 
Proof. Assume the contrary i.e. ParHfx) ^ ParUfi), Then 
there is a block Bi of Part{fi) such that no block B] of 
Part^fz) is equal to Bi. Then only the two cases below are 
possible. 

a) Bi contains at least two input vectors. Then there is 
a pair of vectors jc^c' of block Bi such that they are in 
different blocks of Part{fi). This means that/,(x>=/i(:c' ) 
while /2(j:) 5t/2(jO i e/i and/j are not toggle equivalent. 
So we have a contradiction. 

b) Bi contains only one input vector x. Let B'j of 
Partifi) contain vector x. Block ^'j also contains at least 
one more input vector because otherwise B^Wy So the 
block contains two input vectors that are in different 
blocks of part(fiy Hence fi and /j are not toggle equivalent 
and we again have a contradiction □ 

Proposition 2. Let / and /, be toggle equivalent single 
output Boolean functions. Then f~f^ or where 
means negation. 

Proof, From Proposition 1 it follows that Part{fiyPart{fi). 
Since /i, fi are Boolean functions, Part(fi) and Partifj) 
contain two blocks each. So the only two adtematives are 
/i=/2or/i=V2. 

Proposition 3. Let / and f be two multi-output Boolean 
functions of n Boolean variables such that Par^^^?aIt(f^, 
Then/ and/ are toggle equivalent. 
Proof can be performed by reasoning in the same way as in 
the proof of Proposition 1. 

Proposition 4. Let F be a multi-valued function of n 
Boolean variables. Let C be the multi-valued variable 
specifying the output of F. Let / and / be Boolean 
functions obtained from F by using encodings and (of 
possibly different length) for the values of C. Then/ and/ 
are toggle equivalent. 

Proof According to Definition 1 different values of C are 
assigned different codes. So Part(fx)=Part{fi)=Part{F) 
where Part(F)^{Bx,,.,,B),} and Bi consists of all the input 
vectors for which F takes the same value of C. From 
Proposition 3 it follows that / and / are toggle equivalent. 

Proposition 5, Let/ and / be toggle equivalent. Then/ 
and / are two different "implementations" of the same 
multi- valued function of Boolean variables. 



Proof According to Proposition 1 Part(fi)=Part(ft). Let 
Port(ft)J*art(f2) contain k blocks. Then / and /j are 
implementations of the function F: {0,1}° -> {l,..,jt} where 
f(x)=m, iff JC is in the m-th block of Part{fx\ 

So far we have considered toggle equivalence of 
functions with identical sets of arguments. Below, the 
notion of toggle equivalence is extended to the case of 
Boolean circuits with different sets of arguments that are 
related by encoding functions. 

DeHnition 13. Let {jc,....;cj and y= {y^,..., yj be two 
disjoint sets of Boolean variables. Denote by Enc{XyY) a 
Boolean function satisfying the following two conditions 

1) . There do not exist three vectors jc, jc*, y (where Jc, jc' are 
assignments to variables X and y is an assignment to 
variables Y ) such that jc jC' and Enc{x, y)=Enc(x^, y)= 1 . 

2) There do not exist three vectors jc,y,y' such that y?*/ 
and Enc{xoO^Enc(x, y')=\. 

The function Enc is called an encoding function. 

Remark 4. An encoding function can be viewed as 
specifying two different encodings of the same multi-valued 
variable. Indeed, let r={xi....^k} be the set of all 
assignments to variables of X such that Enc(Xh y)=l for 
some y. Let IV^{yu.., y^} be the set of all assignments to 
variables of Y such that Enc(x, y{) = 1 for some jc. It is not 
hard to see that from Definition 13, it follows that \J^\V\ 
and there exists a "natural" bijective mapping between X 
and Y that relates a vector JC| of V and the vector y^ of W for 
which £/icOci, yj)=l . Vectors JC| and y^ can be considered as 
codes of the same value of a Jfc-valued variable. 
Definition 14. Let /:{0.1}" ^ {0,1}" and / {0,1}" ^ 
{0,1} be m-output and ^-ou^ut Boolean functions and sets 
v^f={x„...,xj and >'={>'„..., yj specify their arguments. Let 
X={X,,..^^} and y={>'„ ..»J',} be partitions of ^ and yintoj 
blocks and EnciX^,Y^,, . ., Enf^X^,Y^ are encoding functions. 
Functions/ and / are called toggle equivalent under input 
encoding function Enc(X,Y)=Enc{X,J,) a ,.. AEnciX^j;) 
if (ftix)^/Xx') A {Encix^)^Enc(x',y')=l))^(f,(y)^ 
f^iy') and vice versa {fj^) i^f^(y') a {Enc{x^)=Enc(j^, 

Proposition 6. Let / and / be toggle equivalent under 
input encoding function Enc{X,Y)r^n^X^,Y^) a ... 
Enc{X^,Y^ . Then / and / are "implementations" of the 
same multi-valued function of s multi-valued arguments. 
Proof follows from Proposition 5 and Renxark 4. 

5. Common specification and toggle 
equivalence 

In this section, we show that the existence of a CS of 
circuits A^t and ^2 means that ^i,;^2 can be partitioned into 



toggle equivalent subcircuits that are connected in Ni and 
A^2 the same way". 

Definition 15. Let N = (KE) be a DAG representing a 
Boolean circuit (here are sets of nodes and edges of N 
respectively.) A subgraph N^V*, E*) of N is called a 
subcircuit if the following two conditions hold: 
a) if ^i«&2 are in V* and there is a path from g, to in N, 
then all the nodes of that on that path are in ; 

if f 11^2 of ^ are connected by an edge in N, then they are 
also connected by an edge in A^. 

Definition 16. Let be a subcircuit of N. An input of a 
gate g ofN* is called an input ofN* if it is not connected to 
the output of some other gate of iV*. A gate g of AT* is called 
an internal gate if all the gates of A/^ whose inputs are fed by 
the ou^ut of g are in N*, Otherwise, g is called an external 
gate. The output of an external gate is called an output of 
circuit A/*. 

Definition 17. Let A^ be a subcircuit of A/' of ;t inputs and p 
outputs. Let AT be the circuit obtained from N by replacing 
A^ with a ^-input, p-output node that inherits all the 
connections of subcircuit A^ to the gates of ^ that are not in 
A^*. We will say that AT is obtained from N by collapsing 
the subcircuit N*. 

Definition 18. Let {V, be a partition of A^ into 

subcircuits. This partition is called topological if for each 
pair of subcircuits Njf the following condition holds: if 
there is node ^ ofN. and node g" of A^. that are connected 
by a path in A^ and level{g) < level(g'% then for any pair of 
nodes g* and g** (where g* is in AT and g** is in //) it is 
true that level(g*) < ievei(g**) {level(g) is the level of g in 

Definition 19. Let A^be a Boolean circuit and A^, aA be 
a topological partition of A^ into subcircuits. Let f be a 
directed graph obtained from the DAG of N using the 
following steps: 

1) Each subcircuit V is coll^sed in A^(i.e. replaced with a 
node Gj with die same number of inputs and outputs as ^ ). 

2) The outputs of each node G. are merged into one. If two 
(or more) ou^uts of are connected to inputs of node 
then all the inputs of G^ connected to G. but one are 
removed. 

T is called the communication specification 
corresponding to the topological partition A^, A^ . 

Remarks. Informally, T describes information flow 
between subcircuits V. A^ . The output of G; is 
connected to an input of Gk in T iff an output of V is 
connected to an input of M in N, 

Remark 6. It is not hard to show that since A^, A/*^ is a 
topological partition, Tis a DAG (i.e. T is acyclic). 
Definition 20. Let T be the communication specification of 
circuit A^ with respect to a topological partition N*, l^. 



Let G^ be the node of T corresponding to subcircuit A/*. The 
longest path from an input of Tto G. is called the level of G. 
and AT (denoted by level(G) and level{H) respectively). 
Definition 21. Let A^,', ...,A^,'' and N^, be topological 
partitions of single output Boolean circuits N,J^^. Let 
conununication specifications of A^, and A^^ with respect to 
partitions A^,', A^,*' and A^,', A^^'' be identical. Denote 
by CfiN^J^^, m==l,...,A the correlation functions 
computed exactly as it was described in Section 3. 
Namely, we first compute correlation fimctions for 
subcircuits of level 1, then for level 2 and so on. The 
correlation fimction CfiN^^^ is obtained from the 
fimction H^Sat{v{N,^)) a Sat{v{N^)) a Q*, the fimction 
Cy* being the conjunction of correlation functions for all the 
subcircuits N^J^^ whose outputs are connected to inputs of 
A^* , A^,". The fimction CfiM^J^,"') is obtained from H by 
existentially quantifying away all the variables except the 
output variables of A^,", AT^". 

Proposition 7. Lcty; and/ be two Boolean fimctions that 
are toggle equivalent under input encoding function 
EnciX,Y}=-Enc{X,,Y,) a ... a Enc(X,J,) (see Definition 14.) 
Let AT, and be circuits implementing/ and/ and V and 
fTbe the output variables of A/, and A^,. Let CJ{V,W) be the 
fimction obtained from the fimction Enc{X,Y) a Sat(v(NJ) a 
Sat(v{N^) by existentially quantifying away all the variables 
except variables of F and W. Then Cf(V,tv) is an encoding 
function. 

Proof, Assume the contrary i.e. CJ{y,fV) 'is not an encoding 
function. Suppose, for example, that there are vectors v,v',w 
such that V 9t v' and Q(v,w)=Q(v ». Since Cy is obtained 
by existentially quantifying away some variables, there 
must exist vectors z=ix^,.,.,v,w) and z' =(j«^y,...,v» 
such that 

1) input variables of A^i and N2 are assigned x,y 'mz and ;c', 
/ in z* and 

2) ou^ut variables of A^t and N2 are assigned v,w in z and 
v',w' in z*. 

Since v ;t v'. then x^x' . Hence there are vectors jc^>y 
such that X ;e x\ Enc(xj^)-^Encix\ /)=1 , A^i(x)=v,Ar,(x'>=v' 
and N^)=nvJ/2(y)^w. But this means that A^i and N2 are 
toggle inequivalent and so we have a contradiction. 

Proposition 8. Let AT,', .... and A^,', A^/ be topological 
partitions of fimctionally equivalent circuits A', and A^^ • Let 
communication specifications f, and ofN^ and with 
respect to these partitions be identical i.e. TrT~T. Let 
each pair of subcircuits A^" and A^j" be toggle equivalent 
under input encoding function Cf (see Definition 21). 
Then A^, and A^j have a common specification S, The 
topology of S is specified by the communication 
specification Tand A^," and A'^" are implementations of 
w-th block of 5'. 



Froof, By assumption each pair of circuits N^^J^i" are 
toggle equivalent if their inputs are restricted by the 
corresponding correlation functions. According to 
Proposition 6, if the correlation functions are encoding 
functions, then yv," and N-i" inclement the same multi- 
valued function. So one just needs to prove that all 
correlation functions are encoding functions. Let 
level{N{^Hevel{N'^)=-\. Then mpm of A^,"* and N^" are 
common primary inputs of N\ and ^2 • One can view inputs 
of Nx" and N-i" as related by the encoding function that is 
the conjunction of functions describing equivalency of the 
corresponding input variables of and Ai". According to 
Proposition 7, the correlation function relating outputs of 
Nx^ and ^^2™ is an encoding function. Then by induction in 
topological levels one can easily prove that correlation 
function Cj^Nx^J^-^ is an encoding function for any value 
of p. 

6. Amethodof HLLS 

In previous sections we developed some theory that 
allows one to check the correctness of a CS of circuits Nx 
and Ni even if this specification is described implicitly. 
Suppose that Nx and Ni have the same topological 
specification defined by subcircuits iV/, N^^ and Ni, 
^2*^ and corresponding pairs of subcircuits are toggle 
equivalent. Then these two sets of subcircuits give an 
implicit representation of a CS of ^1 and ^2- In this section, 
we use this result to formulate a method of High-Level 
structure aware Lx)gic Synthesis (HLLS). 




Figure 2. An example of HLLS 

Let us describe this method by a simple cxan^Ie. 
Suppose circuit Nx is partitioned into three subcircuits 
iV| , Nx as shown in Figure 2 on the left. Suppose that we 
want to obtain^a circuit Ni with a better performance than 
Nx. This can be done by replacing 'tall" subcircuits Nx\ 
Nx\ Nx^ with "short" and 'Vide" subcircuits Nz^ N-^, Ni^ 
shown in Figure 2 on the right. In the method of HLLS this 
replacement is done in the following way. First we pick a 
subcircuit of topological level 1, say A^/ and synthesize a 



"short" subcircuit Ni^ that is toggle equivalent to The 
same procedure applies to the other subcircuit of 
topological level I. The subcircuit Nx^ is replaced with a 
toggle equivalent subcircuit Ni^^ After we are done with 
level 1, we move to topological level 2 i.e. to the subcircuit 
Mx\ 

However, before the re-synthesis of circuit Nx^ we 
need to compute the necessary correlation functions. The 
inputs of are connected to the outputs of circuits Ni^ 
and Nx^ , so we need to compute the correlation functions 
Cf{Nx^J^i^) and CfiNx^^t), This co^^)utation is done as 
described in Section 3. Then we build a subcircuit ^^2^ that 
is toggle equivalent to the subcircuit The toggle 
equivalence of Nx^ and ^^2^ is confuted not "globally** (in 
terms of primary inputs of Nx and Ni) but locally in terms 
of inputs of Nx^ and ^2^ related by the correlation functions 
CANx^ . N2^) and CJiNx^Mi^), Since subcircuits Nx^ and N-^ 
have only one output, their toggle equivalence means that 
N^ and A^2^ (and hence circuits Nx and ^2) are functionally 
equivalent modulo negation. Since ^1 and ^2 have the same 
topological specification and the corresponding subcircuits 
are toggle equivalent, ^2 is a different implementation of 
the same three-block specification that is inplemented by 
Nx, 

Of course, the described method gives only *thc big 
picture" because the procedure for synthesis of toggle 
equivalent circuits is not described. The generalization of 
the method to a circuit Nx with a specification of an 
arbitrary number of subcircuits is straightforward. The 
subcircuits of a specification of N\ are replaced with toggle 
equivalent subcircuits in topological order. Toggle 
equivalence of subcircuits i^i™ and ^2"* is computed locally 
in terms of inputs of ^Vj" and related by correlation 
functions obtained before. 

7. Relation of HLLS to other synthesis 
procedures 

In this section, we discuss the relation between HLLS and 
existing synthesis procedures based on local 
transformations. The main three differences between such 
procedures and HLLS are shown in Table 1 . The key idea of 
HLLS is to restrict the search to inplementations of a 
predefined specification. The justification of such an 
approach is as follows. Suppose we need to optimize a 
combinational circuit Nx that has a high-level specification 
S represented as a partition of Nx into subcircuits AT,', 
Nx\ One can view these subcircuits as implementations of 
multi-valued blocks Gi,.,Git of S obtained by encoding 
values of variables of S with binary codes (however it does 
not mean that this encoding was performed explicitly). 

It is quite possible that the chosen codes are far from 
optimal and so a much better implementation N2 of S than 



Ni may exist However, a procedure based on local 
transformations, be it don't care based optimization [5] or 
SPFDS [6][7], will not be able to find Ni. This is because to 
produce a different implementation of S one has to perturb 
many nodes of Ni at once (when replacing a subcircixit ° 
with its toggle equivalent counterpart). On the other hand, it 
is unlikely that by local transformations one can produce a 
circuit (that is not an implementation of 5) "better** than N2. 



Table 1. Comparison of HLLS and procedures based on local 
transformadons 



Synthesis based on 
local transformations 


HLLS 


Circuit is optimized 
node by node 


Many nodes are perturbed "at 
once" 


Equivalence che- 
cking is not an issue 


Reducing complexity of equi- 
valence checking is crucial 


The search space is 
not restricted 


One considers only imple- 
mentations of a predefined 
specification. 



Of course, the conjecture that the quality of encodings 
matters so much is based on the assumption that the 
specification S captures essential features of the circuit to 
be in^Iemented. An example, of a circuit with a 
"meaningful" specification is a multiplier. On the other 
hand, if Ni has many specifications, sticking to one of 
them does not make much sense. 

It is not hard to see that HLLS and synthesis 
procedures based on local transformations are 
complementary and so can be used together. An HLLS 
procedure can be used as a first step of logic optimization 
meant to in^rove encodings of multi-valued variables and 
so to generate a better starting point for a procedure based 
on local transformations. 



8. A procedure for testing toggle equivalence 

Checking toggle equivalence is a key operation of the 
method of HLLS described in Section 6. In this section, we 
describe a procedure for testing toggle equivalence and give 
some experimental results. Let A^i and be two multi- 
output Boolean circuits. Denote by inp{Ni),inp(Ni) and 
out[Ni),out(N2) the set of primary input and output 
variables of circuits ^1 and A/i. Denote by 
Enc{inp{N\\inp{N2f) a function specifying allowable 
combinations of inputs for N\ and N2, Let ^1 and N*2 be 
copies of circuits N\ and N2 respectively that depend on 
different sets of variables (shown in Fig. 2.) Denote by H 
the function Sat{v{N{)) a Sat{v{N2)) a 5'a/(v(N»,)) a 
Sat(v(N^2)) A Enc(inpiNx)Jnp(N:i)) a 



Enc(inp(N*i)Jnp(N*2)). Denote by Eq(putiNx),oxit(N*i)) 
the function that is equal to 1 iff out{Ni) and out(N*i) take 
the same value. Denote by Neq{out{N\),aut{N*\)) the 
function that is equal to 1 iff out{N\) and o«/(iV*,) take 
different values. Denote by Hy the function H a 
Eq(autiNi),out(N*x)) a Neq{out{N{i,out{N*{ji). Denote by 
H2 the function H a Neq{out{Nx\o\xX{N*\)) a 
Eq{out{N2)fiv^N*ii). 

Proposition 9. Circuits and are toggle equivalent iff 
the functions i/, and (defmed above) are constant 0. 
If part. Assume the contrary i.e. Hx and H2 are constants 0 
and N\ and N2 are not toggle equivalent. Suppose there are 
vectors {pcjt) and (jc*,A*) such that Nx{x) = M\(x*) while 
Niik) ^ N2ih*) where Enc{x,h)-^Enc(x*,h*)=\, Then there 
is a vector w that satisfies the function Hx (and so we have 
a contradiction). Indeed, let w be the set of assignments to 
the variables of circuits NxJ^2J^i*J^2 under the input 
assignment (xjt,x*,h*). The assignment w satisfies 
Eq{putiNx\outiN*x)) because Nx{x) - Nx{x*). It also 
satisfies Neq(out{N2),out(N*2))y because Niih) 9^ Niih*). 
Besides, w satisfies function H (because 
Enc(xJi)==Enc(x*Jt*)=\ and w is picked in such a way that 
it satisfies Sat(v(Ni)), Sat{v(N^) . 5fl/(v(iV*,)) , Sat{v{N*2))' 

Only If part. Assume the contrary i.e. circuits and N2 
(and hence N*x and N^2) are toggle equivalent but either Hx 
or H2 (or both) is satisfiable for some assignment of values. 
Suppose for exan^^le that there is an assignment w such that 
Hx(w)=l, Let jc, A, x*Ji* be the parts of vector w that are 
assignments to the input variables of ^i,A^2»^i»^2 
respectively. Since vector w satisfies 5a/(v(W,)), 5a/(v(Ay). 
5a/(v(Ar*|)) , 5'fl/(v(W*2))., then the rest of the assignments in 
w are consistent with a:,A, x*Ji* i.e. they are the values of 
gates of NiyN2J^iJ^2 under the input vector xji^,h* 
respectively. Besides, since Enc(x,h)=^ 

Enc{x*Ji*yEq{put{Nxlout(N*x))A Neq{out{N2\out{N*2)) 
-1 , then Nx{x) = Nx{x^) while Niih) ^ Niih*). This means 
that Nx and ^2 are not toggle equivalent and so we have a 
contradiction. 



Figure 3 Two copies of circuits yv, and Ni to be checked for 
toggle equivalence 



In the experiments we used 39 circuits from the 
MCtiC benchmark set. Namely, for each circuit Ni listed 
in Table 2 we checked its toggle equivalence with a circuit 
N2 produced from by a random pennutation of outputs. 
Clearly, permutation of outputs destroys functional 
equivalence of N\ and ^2 but preserves dieir toggle 
equivalence. To check if Ni and A^2 are toggle equivalent 

Table 2. Toggle equivalence checking for MCNC circuits 



name 


#inputs 


#outputs 


time 
(sec . ) 


pclerS 


27 


17 


0.01 


frql 


28 


3 


0.03 


set 


19 


15 


0.04 


unreq 


36 


16 


0 .04 


lal 


26 


19 


0 .05 


c8 


28 


18 


0 .07 


cht 


47 


36 


0 .08 


b9 


41 


21 


0 . 09 


ray adder 


33 


17 


0 . 16 


example2 


85 


66 


0 .17 


C432 


36 


7 


0 . 18 


apex? 


49 


37 


0 . 18 


vda 


17 


39 


0 . 18 


ttt2 


24 


21 


0.33 


15 


133 


66 


0 .40 


i6 


138 


67 


0 . 63 


term! 


34 


10 


0 .74 


i7 


199 


67 


0.99 


i9 


88 


63 


0.99 


K2 


45 


43 


1.46 


apex6 


135 


99 


1.58 


X4 


94 


71 


1.58 


x3 


135 


99 


1.70 


xl 


51 


35 


2.28 


C499 


41 


32 


2.41 


rot 


135 


107 


2.97 


C880 


60 


26 


5.07 


frq2 


143 


139 


5.13 


C1355 


41 


32 


6.48 


pair 


173 


137 


9.22 


des 


256 


245 


40.39 


C1908 


33 


25 


47.10 


too large 


38 


3 


70.9 


18 


133 


81 


150.02 


CS315 


178 


123 


193.10 


C3540 


50 


22 


261.28 


dalu 


75 


16 


310.67 


110 


257 


224 


588.87 


C7552 


207 


108 


6,122.5 


Geometric mean 


1.84 


Arithmetic mean 


200.77 



we created CNFs describing functions Ht and H2 and 
checked them for satisfiability. (Since tested pairs N^^Ni 



were toggle equivalent, all generated CNF formulas were 
unsatisfrable.) For satisfiability testing we used the SAT- 
solver BerkMin[l],[2]. The runtimes are shown in the last 
column of Table 2. It is not hard to see that in the majority 
of cases, toggle equivalence was established very quickly 
which proves that the proposed procedure may be used in 
HLLS. 



9. Conclusions 

We introduce a method of High-Level structure aware 
Logic Synthesis (HLLS). The key idea of the method is to 
re-encode multi-valued variables of the specification 
describing the high-level structure of the circuit implicitly 
at the gate level. We show that this can be done by 
preserving toggle equivalence among initial and re- 
synthesized inr^lementations of multi-valued blocks. We 
show that toggle equivalence can be efficiently checked for 
relatively large pieces of combinational logic, which makes 
HLLS a promising direction for future research. 
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Abstract 

In this paper, we propose a new logic synthens me&odology to deal 
with tluB increasins impoitance of the interconnect delay in deep- 
subndcfon technologies. We fiist show that conventional logic syn- 
thesis techniques can produce dicuits v/iddi mil have long paths even 
if placed optimally. Then, we characterize the conditions under which 
this can happen and propose logic synthesis techniques wluch produce 
circoits wldch are %e\Sef for placement Our proposed approach still 
separates logic synthesis from physical design. 

1 Introduction 

Conventiona] logic synthesis assumes that the delay of a circuit de- 
pends only on the delays of the gates in the circuit and mostly ig- 
mm the effect of interconnect delay. However, as we move to- 
wards smaller geometries, interconnect delay is becoming an inoeas- 
mgly laiger fraction of the total delay. In fact. Semiconductor Indus- 
trial Alliance's National Technology Roadmap for Semiconductor for 
1997 [ 1] predicts that interconnect delay will start dominating the total 
gate delay as we move down to O.lSp technology and below. Another 
study by Keutzer, et al (5] shows that for 0l25/i technology and below, 
interconnect delay can contribute anywhoe fiom 50% to 80% of the 
total delay. Therefore, logic synthe^ can no longer afford to ignore 
the effect of interconnect delay during opdmization. 

In this paper, we adopt a diametrically opposite approach to that of 
conventional logic synthesis. We perfonn lo^ synthe^ to optimize 
only for interconnect delay, ignoring the effect of gate delays. Our 
appmsch is based on the simple observation that if an output o depends 
on an input /, then the best way to connect ^ to o is through a path which 
is monotonic &om itooy that is, there are no diversions in the path 
fiom f to o. We fiist show, by means of an example, that conventional 
logic synthesis can produce a circuit for whidi it is impossible to find a 
placement with no diversions m the input-output paths. Therefore, no 
matter how good a place & route tool is, it may not be able to produce 
a ciicuit which is opdmal in terms of interconnect delay. 

We define the notion of Ulesal nodes. Intuitively spealditg, a node 
is illegal if it introduces a diversion in the circuit no matter where it is 
placed. We characterize the condition under which a node b illegal and 
a procedure to convert an axbitraiy drcoit mto a circuit which has 
only legal nodes. We call sudi a circuit a /fi;gal dicuiL Weshowthat 
for a legal circuit, there always exists a point placement of the nodes 
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such that every input-output path is monotonic We also provide a set 
of logic synthesis transfomnations which are guaranteed to preserve 
die legality of a circuit 

The proposed approadi has the advantage tiiat it still maintains a 
distinction between die logic syntiiesis and place & route stages. It 
does not need to tightly couple syntiiesis and phtcement by frequently 
altematmg between die two which can be inefficient and may not con- 
veigeatall. 

2 Previous Work 

So fiirveiy littie woxk has been done to model the effect of intercon- 
nect delay at die logic level Tliis is mainly due to the fact that at die 
logic level, veiy littie information is avail^le about the interconnect 
Most of tiiese approaches [9, 8, 14] use a rougji companion placement 
to estimate the cost of various logic synthesis oper^ons and make de- 
cisions based on this cost. In [13] an iterative approach to combine 
synthesis and placement is presented. Instead of using a companion 
placement to guide synthesis, they use actual placement which can be 
modified Incrementally based on the netlist changes. In [15] a heoris- 
tic to mimmize the layout cost is proposed which doesn't employ a 
companion placement solution. The method in [15] is based on mini- 
mizing the average fanout range and evenly distributing fanouts In the 
chip. It was shown that the diip delay could be reduced by this ap- 
proach if all the input pins are located on one side of the cfiip and all 
the ou^mt jnns on the opposite. Like [15], our approach also does not 
enqiloy a compaiuon placement We analyze conditions under «^idi 
a netiist is not "good^* for placement given the locations of i/o fans and 
tiy to tiansfbnn it into one which is. 

3 Pkdiminaries 

Definitionl A logic circuit L is a S-tupte (1,0,^. I is a set of 
primary inpta pins or simply primary iiqjuts, 0 is a set of primary 
omput pins or simply primary outputs, Eadi element cf 1 and Olsa 
biliary variable. An eletnent fj ^ f is a Junction fjiBi'^i^K Each 
fj is called the global function of the primary output o 

A tope circuit is represented by a Boolean networic [3]. If fij^ is an 
immediate fisnout of in the Boolean networic, we write i^. 
A lo^ circttit ispin-an/gned if each piimaiy iiq)ut / is labeled witii 
a position (j(|»yi) and each primary output o witii position ixo^y^* 
A logic dicuit L is placed if every node n of tiie Boolean netwoik 
representing £ has a position, i.e. every node n is labeled with (jQi»yn). 
and the resulting placement is denoted by 5?^. A point placement of L 
is a placement of L where each node is represented as a point Gwen a 
point-placed circait, a patii, p(/^o)> & primary input / to a primary 
output o is a sequence of coimected nodes fiom f to o, and die lengtii 
of the path, d^^y is the lengtii of all die wires along tiie padi from / 
to o. The path is called monotonic if its leogtii is equal to the 
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Manhattan distance firom r to 0. The plaoement^^ of X is optimal if 
there is no other placement of L whose length of the longest path is 
shorter than that of IPr.. 

Hie coonfinate s^ystem that we use in the p^ter assumes that ihe 
jf-oxis goes from left to right and the ^^-axis goes fiom top to bottom. 

4 Problem Descripfion 

CSven a bgjc ciicuit A the goal is to find a placed dicuit 9ix, such 
that the interconnect delay of the circnit is minimized. Duetoefifi- 
den^ reasons, we want to maintain the decouplh^ of the problem 
imo a separate synthesis phase followed by a place & route phase as 
in the conventional aj^ach. Given a lo^c drcuit L — (i, O, T)* we 
address the problem of finding a Boolean netwodc which when 
placed optimally, leads to a circuit with minimum mterconnect delay. 
It is up to the placement tool to find the optimal placemem for such 
a network. Intuidvely speaking, we are trying to create a circuit for 
i^tch a "good" placement exists. 

We assume that the die is represented by a rectaiigle K with width 
Mfft and height and the given togic drcdt is ptn-asstgned. We as- 
sume that the delay of a path is a Gnear funcdon of its length. In 
goieral, the interconnect delay depends qoadratically on the length of 
the interconnect However, it can be made linear by buffer insertion 
and wire sizing, as shown in recent studies by Otten and Bmyton [7] 
and Cong and Pan [4]. Acircuit is said to be optimal in terms of inter- 
connect delay if the length of a path from any primary input i to any 
primary output o is its Manhattan distance (monotonic)« i^ 



This defimtion is modvated by the pin-to-inn delay model of Kuki- 
moto and Br^n [6]. Under this moidel, a delay number is assigned 
for every inpnt-ou^ pdr: Tliis model is pai^ariy suited for in- 
teUectud property OP) bk>cks where the airiva] time of the pms are 
not known m advance. Consequently, any iiqmt-oiilpnt path can end 
upbemg acridcal path. Ht^ore, to minimize the delay, we have to 
m hti mi ze the del^ for all iiyut-outpat paths. We call this problem die 
IP-based synthesis problem. 

We will also be addressmg a slightly different problem called die 
sUxdc-based synthesis problenu where the only diffmnce from the IP- 
based problem is the objective function. Instead of minimizing the 
length of die path fixmi any primary iiqrat to any primary ou^t, we 
minimize the longest path of die circuit, i.e. 

mm{^ max 

In diis p;^ we will m ainly focus on the IP-based synthesis prob - 
lem. However, the approach can be modified to address the slack* 
based problem as weD. We will very briefiy discuss this in Secdon 7. 



5 Approach 

lb understand the problem better, let us first took at an example v^iere 
d» conventional logic syndiesis which considers only gates during op- 
dmizadon may not be able to find a circuit wiUi minimum interconnect 
delay. 



Hgure I: Network and its optimal placement 

SJ. Logic Syndesis and Interconiiect Dday: An Ex- 
ample 

Let us consider a minimum literal boolean network 9^ widi 10 lit- 
erals as shown in Rgure 1 on the left Assusung diat die jun positions 
am given, die optimal placement of !^ is shown in Figure 1 on die 
right Pins e and /are not shown and are assumed to be close to yi. 
In this sohttion, them are two longest padis of equal lengdi, t.e. one 
pathfrom^toyt and the odierftom 6 to yj.Hiis circuit is not optimal 
in terms of both die IP-based and die slack-based syndiesis problems 
because there is a better decomposition of die circuit that produces 
shorter longest padis. Hie better decomposed network widi 1 1 literals 
is shown m Figure 2 togedier widi its optimal placement Aldiough 
netwodc has fewer literals dian it has an extra padi from b 
to yi. Consequendy, the placement tool places node z to nunimize die 
longest paths from ^ toyi and>^. However, as we see m Hgure 2»>2 
is i nd epe n dent of b and dierefore, 6 can be removed from die support 
of xt- "Hils leads to die network in Figure 2 whose optimal placement 
has shorter longest path as compared to 
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Figure 2: Network and its optimal placement 

Aldmugh network ^ is better dian in terms of bodi IP-based 
and slack-based syndiesis problems, diere is yet a better decomposi- 
tion for die IP-based symheds problem. tn5\^, diepadiiromctoyi 
is greater dian its Manhattan cfistance. Hie same is true for the padi 
from </ to yt- A better deconq)osed circuit iV]^ widi 11 literals and its 
optimal placement are shown in Figure 3. 
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Hgure 3: Network 3^' and its optimal placement 

From die example above, we see diat sometimes die output of a 
logic syndiesb is not "goocT for placement, i^ no matter how we 
place the nodes, dtere is at least one padi which is longer than its Man- 
hattan distance. In our approach, the aim is to guide logic syndiesis 
such diat it produces a circu it which is go od for pi acement It is up to 
the placement tool to find die optimal placement for die decomposed 
chcuit hi the placement phase. 

In dus section we define what we mean by a cirouit which is "^good" 
for placement and dien give a set of transformation rules which can 
find such a cirouit Our approach'can be divided into two broad stages: 
constraint generation and constraint driven syndiesis. In die constraint 
generation step^ we partition die die into regions and identify die ^pes 
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of foncdons that are allowed to fin them. We define die notion of 
iBegd nodes. Intuitively speaking, a node is illegal if it can not be 
placed somewheie on die (fie without canshqg a divecdon hi the ciicuiL 
We show dial if a ctrcnit consists of only legal nodes tiien there exists 
a pomt placement of die nodes sodi that eveiy iiqnit*ompat padi b 
monotonlc. We caO such a circuit a %izf citcuit We characterize die 
cottdidoQ under which a node is illegal and give a pxoceduie to convert 
an arbitraiy cbcdt mto a legal circuit 

Since nodes have areas, in the constramt driven syndtesis step, we 
syndiesize die legal circuit to find anodier legal circuit vn!^ nuni- 
mum area. We extend die algebraic transformations and donH care 
mimmization such diat diey operate on legal nodes and produce legal 
nodes. As m the conventional logic syndiesis case, we use the nurnber 
of fiactoied*form literals as our area estimates smce it has been proven 
to be a good indication of die size of a Boolean network. 

5,2 Constraiiit Generatfon 

Since die length of eveiy path from a primary input to a primary output 
is restricted to its Manhattan distance (monotonic), there is a well de- 
fined region where a Boolean node can be placed. Let us define region 
formally. 

DeSn!tion2 A region r= {xt^yi^tyb}* ^^re xi <Xr mdyg <yi^ 
is the set ofaU points in the rectangle bounded at opposite comers by 
the points {xf^t) and (jf„>^). MathematlcaUy, r — {(x,y) \xi<x< 
Xrandyt<y<yhY 

Definftlon3 Given two points p\ = (jci»yi) andpi^ (xiijft). the 
region defined by pi and P2 is region r^^^^ = {min(xp,^pj), 
^(ypi typ»).niax(xp, ,Xpa),max(yp, ,yp^)}, 

Wih diese definitions, we ff> back to analyze why node z of die 
Boolean network 9{! in Figure 2 is "good" but not x. Because node z 
fansouttoyi and its support setts {a,^,c»d},zshould be placed in die 
regjon r^^ ^) » which is tn Figure 4, so that die padi finoro any primary 
input in the support set, i.e. a, 6, c, or d; to yi ismonotonic Fbrdie 
same example, there is no good region to place node x because there 
are two conflicting requironents. One reqtmement says diat node x 
should be placed in regumr(y,^^, which is r| in Figures, fordtepatit 
from c to yi to be monotonic; while the oth^ says that node x should 
be placed in legbn r(^^, which is rx in Figure 5, for die padi firom if 
to y2 to be monotonic. As shown in the figure,x can not be placed in 
bothri and II. Hence, X is not a desirable fector; 



b t e 4 

Figure 4: region of node z. 
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I^gure 5: Conflicting legal region requirements for x. 



S2J. Region Placement Constraints 

HiB exampte above OlnstratBS that if there is a padi from a primary 
irqrut I to a primary ompm 0, dien for the padt to be monotonic all die 
logic gates aloi^ die |»di shouM be placed in the region r^j^y This 
leads 05 to first partition die cfie hito rectangles along die idn positions 
and labd each re^on widi functions diatcan be placed in IL Continu- 
ing widi our example, die die area associated with yt >y2i and d 
is partitioned into regions {n ,72i r^if^i^rs) as shown in figure 6. 
R^nn is labeled widi{0,^,c,<f}y, to mean diatfiictors whose sup- 
ports are a subset of {a^b^e^d} and transitively fan out only to yi are 
allowed to be placed in Region rj is labeled widi {cyd}y^ and 
(a,^}^ to mean diat foctors whose supports are a subset of {c,^ and 
transitively fan out only to yi or factors whose supports are a subset 
of {a^b} and transitive^ £atts out only toyi are allowed to be placed 
in f^. Odier re^ns are labeled m a sinular fashion. Refeiiqg back to 
Boolean network we see diat node z is a "goocT node and can be 
placed in n because its support set is {a,6,c,d} and it transitively fans 
ontonly toyi. Tliis matches die label of r|. Node x is not a "good** 
node because there is no region whose label contsdns its support set 
[Cfd} and bodi of its transitive fenouts areyi andy2* 
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Figure 6: Regions and labels of regions. 



Defixdtion4 A placement constraint d is a 2-tuple (0^,0^), Y^here 
O^CO, and C /. is called the otaptaset and the support 
set ofd. We also write d as {/i,/2>"-}oi,02,». ^here = 
anda^ = {01,02,...}. 

Each region is labeled with a set of placement constraints, e.g. r\ is 
labeled widi {a,2r,c,<f}^ and o is labeled witii {c,d}^, and {a^b}y^ 
as diown in Kgure 6. A placonent constramt on a region r is called 
its region placement constndnt. 

Hence, each region placement constraim dr = (<y^ o^) in a region r 
denotes diat Boolean nodes that fan out only to a subset of die primary 
outputs in and have at most in dieir support can be placed in r. 

Node Placement Constraints 

We see that given a re^on r, only certain types of nodes can be placed 
in r and this is cqmired in its region placement constraint. We now 
define the dual for nodes. Given anode fi; it can only be placed in cer- 
tain regions. F6r example, node z of Boolean rwtwork 3^ in Hgure 2 
can only be pieced in region n as shown in Figure 6. Hence, we label 
each rKKle widi a placement constraint and it is called its node place- 
ment constndnt. The node placement constramt of node n denotes die 
support of n and its transhiveprunaryou^mts. For example, die node 
placement oonstrahit of z of Boolean network ts {a,^,c,d}^ . 

The node placement constraints of nodes of a Boolean netwoik can 
be eaaly computed by traveishig die Boolean network in a breaddi- 
first manner firom the primary mpnts to compute the support sets and 
from die primary outputs to compute the ouQmt sets. 
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523 Properties of Flaoement Constraints on Boolean Netvrorlcs 

In this section, we show what "good" nodes mean and having a 
Boolean Netwoik vndi only "good** nodes can lead to a monotonic 
pomt placement of the networic. 

Intuitively; a *^oo<r node is one that can be placed in a region. We 
define such *'good~ nodes as legal Hbwevei; befote wecan fbraially 
define the legality of a node, we need the definition of containment of 
placement construnts. 

DefiidtibnS Placement consnubu da = (0^,0^) is contained in 
piacemem constndnt dt^iCf'jd^), denoted as da Qdi^ ^O^CO^ 
and&'Qcf'. 

De&nition6 Boolean node n with node placement constraint 4i is 
legal with respect to region r with region placement constndnts 
denoted as n[,r^ Ifthereesdstsa J sttchthatd^Qdr^. 

Definition 6 says that node n is legal with respect to region r if n 
can be placed in r. 

Definition 7 A Boolean node n is legal if there is a region r such that 
Air. 

Definition 7 says that node n is legal if there is aregion r where n 
can be placed. This definition and Definition 6 aie about the legality 
of a Boolean node. Now ^ven a node, the next definition defin^ the 
region in which the node is legaL 

Definition 8 The legal regions of a node n, denoted as R[n), is the set 
of regions ^ = {n i /If ,a} such that for any region ry € « I r;. 

For clarity purposes, we denote the le^ region of a node n with 
node placement constraint 4, as R{d„). We will then asswne that given 
a node placement constrdnt, the node is implicitly defined. 

It can be easily seen that R({ik}oi) >s the region r^f^^^y If we define 
R(di)f\R{d2) to be (he overlapping cegbn between R{di) and R{<b)» 
then it is easy to see that ,/2, . - Moi^u-^n) equal to: 

R{{hU)nR({i2U)r\'''nR({i„}a,)n 
R{{iiU)nR{{i2}o2)(^-nR({imMn 

•••n 

i?({/iU)ni?({/2K)n-..nii({t„}oJ. 

This is called tlie intersection rule. For example, as shown in Figure 7, 
for node z of Boolean netwoik 

R{z) = R({a,b,c,d}y,) 

= m<^}n)^mb}y^)nR{{c}y,)nR({d}y,) 

= 

Based on Definition 7, the legality of a node n with node placement 
constramt da = {0*^<f) can be checked by traversing all regions and 
dieck if A is legal for each le^on. Assuming |/| > |0|, the complex- 
ity for thb algorithm is \0\) because the number of regions is 
0(|/| |0|) and the number of region placement oonstramts b a region 
is 0(|/|-f |0|). A better algoridun would be to check if the legal re- 
gion of n is empty or not. Tliis can be done by uang the intersection 
rule defined above. The complexly is then 0{\(y*\ |(f |), which can be 




Hgure 7: Re^n intersection for node z of 

much smaller However; there is a linear algorithm with complexity 
0{\C^\ 4* \<f\) accoidii^ to the next three lemmas. 

Lenima 1 bdow says that mxfes that transitively fan out to only one 
ou^ are always le^ 



il fbr a node placement constraint {ii|/2>.<.t4n}oi/»2.-^« 
wftft n= 1, il({/i,fe,...,/«}a,/»i^) 9fea 

Phioft For {/i,/2»"M'm}©it the pdnt (jio„yo,) is in 
/?({/i,/2...m'«}o,). ■ 

Lemma 2 below enumerates the cases when nodes that transitively 
fan out to two ou^ts are legal. 

Lemma 2 For a node placement constraint 0if/2>-..,4i}oi^p^« 
withm>2andn^Z R{{iuii,..^,im}ot/n,^) t^^W 

L {^Noxi>xo^yi>yo)'^^i^oxi>xohyi<yo)y^iioxi< 
xoAyi> yo) V ^tioxi <xo/\yi< ya), or 

2. Ri{iulz,'' ,im}ot)isapoint,Le.Xoi==XoiA\/lyi = C,oryoi^ 
yoi A>fix{ = C, for some C € 5\6 

Phmf^ Ifpart: 

1. Let us assume without bss of generality that ^Noxi > 
^ Ay/ > yo). and let Ixx^^ = (min{xi}tmin{yf}) and Otasx = 
(max{j;r},max{yo}), then the legal re^on is r^t^^^ and it 
is not empty. 

2. If the legal region is a point, then it is not empty. 

Onfyif part: Wihout loss of generality assume that the legal re^on 
isiK>tenq>tyanditisnotapoint.batJc^, <Xoi <JK^,i.e. o\ is on the 
top side of the die, then /I({/i,/2}tf|) ts a point if both /| and (2 are 
on die t0p side as well Q^gure 8a); it is a line otherwise (Hgure 8b). 
Since/J({/i,fe}o,rt) =i?({/i,l2}a,) Ai?({/|,t2}o,). it is not empty iff 
yii —yti andji^ =A«2* i-e. ot and 02 ue at o;^osite ade (Rgme 8c). 
If we have more than two hqnits, then they all have to be either on 
the top or the bottom dde of the chip for the legal re^on to be non- 
empty and the legal re^on has to be a point (Hgure 8<0- Hence, it is a 
contradiction. ■ 
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(b) (c) (<0 

Hgure 8: Figure for proof of LemmaZ 

Tlie following lemma says if a node tranatively fans out to more 
than two ou^mts, then there can only be one case where it is legal 
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Lemmas For a itode placement constndnt {^ufli— »tii}*i,flwii 
wiihm>Zwuln>% ie({i|,/2,-.., ^ 0 ff(iiioxi > 
Xo^yi>yo)^f (VM> J^>XoAyi<yo)y (ViVo < jq, a yj > y J V 

^mott The proof Is similar to the proof of Lemma 2. 

If part: Hits is the same as the first case of the if part of Lemma 2 
proof. 

Only if part Without loss of generality assume that the legal re^on 
is iK>t empty ImtXi^ <Xc^ <jcj^,i^ oi is on the tqi ^ of the die, 
dten Ri{iui2}ot is a point if both and /2 aie on the topside as well 
ffiguie 8a); it is a line othenwse (Rgmc 8b). Since /^({/i,/2}i>,^) = 
i?(ft,^2}oi)Ai?({/i,/2}o,),Uis not en^)iy Iffy/, =y^ and^;,, =x^. 
icoi and 02 are at opposite side (F%iire 8c). There is so way to add 
a third ou^ut to {iukhuoj widi anon-empty legal region. Hence, it 
is a contradiction. 9 

By the input-output symmetric nature of legal regions, the above - 
three lemmas qiply widi the role of m and n interchanged. 

Let the condition (ViVox/> j^, Ayi>yo) V(ViVojci >;«eAy^ <ytf) V 
Xi<x^Ayi>yo)V (Vft/o xi<XoAyi< y^) be called the nan- 
overlapping condition. Hien. with these three lemmas, die legally of 
anode wiUi node placement constraint {/|,/2,...,/«}<„^^^ can be 
checked with the following algorithm: 

L Ifn is l.dien die node is legal 

Z If the non-overlapping condition is true, then the node is legal 
Ihis can be checked In 0(m-(-n) by first fining die largest and 
smallest x and y ooordinates of bodi iiqnits and outputs and then 
chedc for die overlapping condition nsiiig these values. 

3. If die node placement constrdnt satisfies Condition 2 of 
Lemma 2, then it is legal. 

4. If none of the above axe satisfied, then die node is illegal. 

It is obvious diat this legality checking algoritiun is 0(m+n}. Hence, 
it is very efficient 

CoroOaiySJ There exists a comer point pc (f 
^(ft>^2>..-ifiii}ai^,...,tf,) thai Is dosest In distance to ail out- 
puts, and a comer point pfjunhestfiom all outputs. The point pe Is 
called the closest point of the region and pj the furthest paint 

Lemma4 /. flr R({lui2,...,l„U^^J n mU^^^l 
where (t ^ vi>'2>--)M* ^ ""f empty^ then It contains the 
dosest point qfR({luhy'>',lmU,pu~fi^). 

2. ///?{ft,l2,--M««}tf,^.»,tfJn^({/l,l2,.",4nW Ok ^ 

{oi ,i>2v " tOa), Is not empty, then It contains the furthest point 

Phwf: Assume diat m > 2 and n > 2. Hie proof is sinularfor other 
cases. 

I. Assume (VfVo x^>AbAy^>ya) (die proofe of die otfwr cases 
arBdiesamBXdienjgt>ai>A>3t>yo. Ifjtjt is greater dian die x- 
coordinalesofanyodierinpu|,dien/Z({i,,^i,,..,/«}^^^)n 

islessdian 

the x^coordinate of all odier inputs, then die vertical line goii^ 
diroug}i4paititionsi?({/i,/2,...,/«}o,^^J into two regions 
^^{{h,hy^'*tlm}ox/>xr^^a)^R({lk)cxpi,^^^^ is die partition 
diat ittdudes die closest point 



2. Theproofissnnihirtocasel. 



Lemma 4 says diat: 



1 . Adding inputs to a node placement constraint will not change the 
dosest pmnt of its legal re^n. 

2. Adding outputs to a node placement constraint will ru>t change 
the farthest point of its legal region. 

At diis point, we have defined what legal nodes are and how to 
check for legaHty of nodes. We now put die legal context into Boolean 
networks and discuss die implbation of legality of Boolean netwodc 
on placement 

Defiiiition9 A Boolean network Is Is every node In the network 
./J legal 

Hiere is a nice property of a legal Boolean network as described by 
the foUowing theorem. 

Theorem 5 J Given a legal boolean network, there exists a mono- 
tonic point placement for the network, 

Fkoof: 

This is an induction proof. We traverse die Boolean network in a 
revise topological order, Le. a node is visited only after aO its fanouts 
have beoi visited. 

Hie base case is where we have all primary outputs. Let o be an 
arbitrary primary output, then place at its pm location. For o, its phi 
location is its closest point The inductwn hypothecs is that fanouts of 
a node n axe placed at their dosest points and still mamtflfniiig mono- 
tonicity, i.e. die distances finom their dos^ pomts to dieir primary 
outputs are their Manhattan distances, we show that n can also be 
placed at its dosest point while still maintaining monotoniclty. 

Let n/ be an arbitrary fanout of n. Let c' be the node placement 
constraint of n/ widi all fanins except n removed. Also let die node 
placement constr^ts of n and nj hoc and c/. Then c/ is derived 
ftom by adding die primary inputs of &nins of n/ other than a and 
e is derived fiom t/ by adding the primary outputs of fenoms of n 
odierdiann/. We know diat /{(c') ^ 0 because C c and /{(c) 96 0 
by the assumptkm that n is lei^. By applying Lemma 4 for ^ch 
primary input added to to form c/, R(c/) mcludes die dosest point 
of Since R{c) C /2(cO. die distance from die closest point of 
i{(c) to a primary output 0 is die same as die sum of the distance from 
the closest point of R{c) to the closest point oTR{cf) and the distance 
from the dosest point of R(cf) and 0. Hrace, the monotonic property 
is maintained and n can be placed at die dosest point of A(c). 

■ 

Theorem 5.1 reduces our problem of findii^ a monotonic point 
placement of a circuit mto die problem of findhg a legal Boolean 
network. The logic syndiesis transformations we use to convert an 
illegal Boolean network into a legal one is called makeJegal^ and it is 
explmned below. 

S2A MakeJUeal 

The makeJegai operation takes a Boolean network as its input and 
produces a legal Boolean networic In die effort of produdng a legal 
Boolean network, it attempts to minimize die numbn* of new Boolean 
nodes created. 
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. Hie follourang temma and ooroUaiy guarantee that a Boolean net- 
work can ahvays be made legal 

Lemma 5 Jfn-^nj, and n Is illegal but n/ Is legale then collapsing n 
Into n/ will not make it/ Ulegd. 

^twdi Oollapsing n to a/ does not change the si^port of a nor does 
it add any primary ou^ut to th e transitive famnl of n /. Therefore, the 
node placement constraint of n/ does not change and hence n / stays 
legal ■ 
By the proof of Theorem 5.1, we know that every primary output is 
legal Then it is easy to see the follov^ oorollaxy. 

CoroUary 5.2 An Ulegd Boolean network can always be made legal 
by collapsing dl nodes Into the primary output nodes. 

Beside coH^^mg, node dtq>licadon can also legalize a node. 

Lemmatf If n^-^'n/, n-^ng, and n Is Ulegd but both n/ and Ug are 
legal, then duplicating n Into n\ nj and rti^rtg mdaes A| andnz 
legal 

Rnoofi The support of a is a subset of both the supports of a/ and 
A^, but the output set of the node placement constraint of a is a superset 
of the node placement constramts of both n/ and a^. By duplicating 
A into A| ~¥ nf and A2 -¥ a^, node placement constraint of A| is con* 

tained m that of A/ and thus Ai b legal Sunilarly for A2* ' 
MaikeJegal traverses the Boolean network in a reverse topological 
oider^ i^e. a node is visited dSta all its fanouts have been visited. Dur- 
ing the traversal if it sees an illegal node, it colitises the node into its 
fanouts until the tu>de becomes legal. Hence, there is a fomtier moving 
from each primary ou^ut to primary inputs in its support where every 
node is legal on the »de of the fiontier toward the prinuuy output If 
the sum-of-pxoduct expression of the fanout, as a result of collapsing a 
node mto one of its fanouts. exceeds a oso^iefined parameter, r, num- 
ber of Htetals, the node is r^licated for each fanout until it becomes 
legal The mtuition behind this parameter is that large nodes tend to 
have more common subfunctions with other nodes and thus altow for 
sharing. However, the parameter should not be too large since it can 
result in explosion in memory usage. 

As shown above, legality of a node can be checked efifidendy, that 
is, it IS linear m the size of the node placement oonstraim. Hence, the 
makeJegd operation is efficient 

53 Constraint-Driven Synthesis 

The constraint generation st^ takes a possibly illegal Boolean net- 
work and makes it legal Theorem 5.1 guarantees diat there exists 
a point phtoement for tlds network. However, by definition of the 
point placement of a circuit, nodes are assumed to be a point; hence, 
they have no area. In reality, nodes have area and the length of a 
kmgest path depends strongly on the size of a Boolean network. Tlie 
constrahit-driven synthesis step is responsible formiiumiztng the area 
of an already legal Boolean networic while preserving its legality. As 
mentioned in Section 5, we use the number of &ct<Hed-form literals of 
a Boolean network as a measure of the area of the circuit represented 
by the Boolean netwoik. So this step is to optimize the network such 
that we get a minimum literal legal Boolean network. 

We leverage the well developed algebraic transformations ui the 
conventional togic synthesis by extending them to deal with and pro- 
duce legal Boolean nodes. Each of these operations is explained be- 
low. 



FastExtract 

The/ortjexr/ncf algorithm is e^qildned in [16]. It basically looks for 
a two-cube divisor or a two-literal cube that reduces the most number 
of literals m every iteration. 

When dealing mUi legal Boolean itttwork, this algorithm may re^ 
suit hi illegal divisors. Fcx example, assume that node a is the best 
divisor found and it divides nodes jK;y, and z. Then die ou^iut set of 
the node placement constraint of A is the umon of the ott^ sets of the 
node placement constraints of jc, y. and From Section 5.2, we know 
tiiat the legal region of a may be emp^ and a may therefore be illegal. 
However, it may be the case that a remains legal if it only divides x 
and or jc and z, etc. Hence, the fastjextract algt^thm is modified 
such that the best legal divisor is chosen in every iteration. 

If node A divides a set of nodes iV, then complexity of finding a 
subset^/ of N which preserves the legality of a and has the largest re- 
duction in the number of literals is exponential in die size of N, Hence, 
aheuristic is used to select an optimal subset First the ruxles in ^ are 
ordered in decreasing sizes of the legal regions to form a listN^i,^. 
Then is linearly traversed. Each node is added to the subset Nt 
if the legality of n is preserved. Node a is used as a divisor If it reduces 
the number of literals in tlie netwoik. 

In this paper, the fasLextract implemented in SIS b used. 

53^ Resubstftntkm 

In the conventional logic synthesis, a node a is resubstituted into an- 
other node jc if A divides x This may affect the legality of both a and 
X, The followmg observation states when a and x can become illegal. 

Observation 1 Ifn divides x and both n andx are legd before resub- 
stitution, then efier resubstUutlon 

I, xcan become lUegd ff its support Is not the superset cfthat pfru 

Z n can become Ulegd if its output set Is not the superset cf that of 

In ihis paper, n can only be resubstituted into jc if die legaHty of a is 
preserved. Hence, a check is made before every resubstitntion. 

533 FtaHSimpfily 

There are two types of dont cares^ le. the observabOity don't cares 
(ODCs) and tiiesatisfiability don't cares (SDCs). Consulting Uie exact 
ODCs of a node is computationally expensive. In practice, a subset of 
die ODCs called the compatible ODCs (CODCs) are onnputed. Tliese 
CODCs are eiqpressed m terms of die primary inputs. Then togedier 
with the external don't cares (XDCs) of die primary outputs, a don*t 
care set in temis of the immediate fanins is computed udng an im- 
age conqnitation. In computing the SDCs, a snj^tort filter is used. A 
node is mduded in die SDCs if its support set mtersects die support 
set of die node being considered. Employmg SDCs m die minimiza- 
tiott procedure can result in boolean r^obstitutions. Tlie support filter 
procedure can also be used in die im^e computation of the CODCs 
and XDCs. Once die SDCs are computed and die XDCs and CODCs 
are expressed in terms of immediate fsmns, a two-level minimization 
algoridun is invoked to find an optimized expressbn. This is simply 
a biief description of fh^fldl^hnplify. For a more detail explanatton. 
we refer die readers to [10]. 
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Lemma? Thnitghom JUlLsimpUfy con^maOon, the ody steps that 
cm tntroduee iQegaUty Into the network are die Image compsaathn 
and the SDC computation. 

Pknoofi Let node n be the node we are conqmdng don*t caies foi; 
Legality of the Boolean netwoik can only change if an edge is acbled 
to the netwoik. Duiing the whole fulLsimplify process, only the lanm 
edges of n can be added. Edges of famns of other nodes can not 
change. Adding a famn e4ge to n means that a resubstitntion h^pens 
and Observation 1 appL'es. Potential new fanin edges of n aie added 
only during the image computation and SDC contputation through die 
soppoit filter, \(litch basically that a node x is a potential divisor 
ofnifthesnppoitofxmteisectdiesuppCfftofR. ■ 

We therefore constraua this operation by allowing a node x to be 
in the support filter when computing fuILsimpfify Ua node a if the 
inchision of node x preserves the legality of the networic according to 
Observation 1. 

S3 A Synthesis Flow 

With all the above basic operations, a ^thesis flow is then a script 
shnilar to the script.rugged m SIS. An empirical study needs to be 
conducted to derive an optunal script 

6 Experimental Results 

lb see the effect of the proposed qiproadu we have implemented the 
basic operations described in Secdon 53. An opthnization scr^ has 
been created and we call it scriptwire, which consists ot 

elifiiinate S 
sveep; elinisate -1 
sispXlfy -m nocomp 
eliminate -1 
sneep; elimiiiate 5 
siisplif y *a nocoiap 
resnb ~a 
fz 

resttb -a; sveep 
eliminate -1; sveep 
fuXl^sioplif y -ra nocoap 

Our experiment uses SIS and RiUial version 3.4. a dming-driven 
standard cell placer [12]. The input blif file and a randomly gener- 
ated pad assignment file is read into SIS. The script,wire optimization 
script is run m SIS to generate an optimized logic netlisL The op- 
tsnized netlist is tasppcd to the standard cell tedmology libraiy std- 
cetQ^getdib of SIS. The mapped netlist is then phioed by Ritual with 
a fixed pad assignment We measure the lengdi of the hmgest path and 
the delay of the Ritual ouqmt The distance of two cells is measured 
as the Manhattan distance finom the center of both cells. The length of 
apath is the sum of all distances between oonsecutivB cells along the 
p^ 

T^le 1 shows the results for four circotts. ThtckowX bbaruCombk 
obtained fiom the sequential circuit Mfora by removing all latches aiui 
treaiii^ the ouQmts of the latches as primary inputs and the iiqrats to 
the latdies as primaiy outputs of die network. Column 2, 3» and 4 show 
the number of literals in factored forms of the scripts scriptrugged. 
scriptdelay. and scriptwire respectively. Columns 5, 6. and 7 list dte 
length of die lortgest padi for each scrqit Colimms 8. 9* and 1 0 show 



the CPU time. The eqieriments were run on a DEC AlphaServer 8400 
with 20B of memory. The runtime is for the tecfanotogy mdependeni 
step. 

As shown m tins table, although the number of literals bicrfptu^/s 
approach is more than that of script.rugged; die ler^ of its longest 
p^ is die same for rd53 and better m other circuits. The longest padis 
are much shorter dian scriptMlay results. As seen ton diis table, the 
runtime is comparable. This is expected since the legality checking 
is linear in die dze of the node placement constrdnts and hence its 
runtime is ammor partof the total runtime. 

Ibble 2 shows the delay computed by Ritual for the four circuits. 
Columns 2, 3, and 4 show the cell delay foreach script The wim delay 
is shown in columns S» 6. and 7. The total delay is listed in columns 8, 
9. and 10. Excqrt for the total delay of z4ml runniiig scriptMlqy^ the 
total delay of all circuits b die best using script»wire, 

M I i t "TlWy 
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Figure 9: Number of literals vs number of nodes legalized for C13S5. 

7 Open Issues and Future Work 

Though the results in the previous sectbn shows that the approach per- 
forms satisfactorily, diese drcuits are fairly snail. For bigger circuits, 
the number of mvdes in a legal network can be large and optimizing 
such large networks using operations Hke fast^extract and fidLsSmpUJy 
can be very expensive. 

Tb Ohistrate diis, we plot die number of literals versus the number of 
nodes m the constiamt generation step for C1355 as shown m Figure 9. 
On die X-axis is die number of illegal nodes that are legalized. On die 
y-axis is die number of literals in die Boolean network. Thenetwork 
increases from 1032 literals to 23709 literals after216 nodes have been 
legalized out of a total of S 14 nodes in die networic 

There are three various directions diat can be pnrsued to adchess dus 
problem. The first one is to improve die area optimization algoridun 
presented m tins paper. Rewiring and redundani^ removal is a tech- 
nique Oiat foils mto this direction. SPFDs [2] can be used to minimize, 
rewire circuits, and potentially legalizmg nodes. In this paper, we are 
assuming diat we are ^ven a drcdt represented as a Boole^ network. 
We then app^ makeJegal and sever^ algehndc transformations fol- 
lowed by don't care minimization. The final circuit depends on die 
q;uality of die initial Boolean network. Alternatively, die Boolean net- 
work can first be collapsed as much as pos^le into a two4evel circuit 
where all primary outputs are expressed in terms primary inputs. Then 
fonctional decmnposition, like [1 1], can be ised to decompose die net- 
woric mto a mim'nmm literal legal Boolean network. 

The second direction which we believe is more promising is to relax 
the constraint that every padi must be monotonic. In odier words, dus 
is about solvmg die slack-based syndiesis problem instead of die more 
restrictive IP-based syndiesis problent This can be done by applymg 
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Tkble 1: P&th length con^arison of scr^,rugged^ scriptdeU^t and scnptwire for IP-based synthesis* 



Name 


Nmn 


ber of literals 


Length of Longest Path 


CPU Time 


sc^rugged 


scdelay 


sc.wirc 


scjqgged 




scwire 


sc jugged 


scjdd^ 


scwne 


24ml 


41 


84 


49 


1324 


1342 


1025 




03 


03 


rdS3 


42 


62 


50 


1122 


1624 


1122 


0.1 


03 


0^ 


idTB 


74 


178 


87 


1689 


2457 


1680 


0.8 


1.8 


\2 


bbantComb 


69 


79 


109 


2021 


1573 


1464 


03 


OJ 


03 



'Bible 2: Delay comparison of scriptnigged^ scr^tdelay, and scHpMire for IP-based synthesis. 

It WlreM^ |f~ 



Name 



Cell Delay 



Total Delay 



z4nil 


5.66 


5.28 


4.78 


093 


LQ3 


0.97 


6J9 


631 


5.75 


nlS3 


9.73 


737 


5.94 


1.67 


Z13 


142 


llw40 


9.50 


736 


id73 


7.01 


5.09 


5.59 


137 


088 


0.86 


838 


5.97 


6.45 


bbaraComb 


8.18 


602 


450 


il9 


1.72 


1.08 


1037 


7.94 


5S8 



the IP-based synthesis algorithm only to a subset of the paths. Intu- 
itively, we can wireplan only the critical padis so that no diverrions 
ate altowed in them; other paths can have diversions. One qiproach 
would be to modify the definition of legality so that legality is checked 
based on the primary mputs and outputs that are relevant only to the 
critical paths. Only the nodes on the critical paths are legalized. We 
have done some preliminary experiment and our results show that if 
you select lop few longest paths and legalize all the nodes on those 
paths* then the area penal^ is not very high. However, at present there 
is no easy way to perform a meaningful comparison of diis approadi 
modified IP-based algorithm to solve the slack-based synthesis 
jRoblon) with the conventional approach. For that, we need a plaoe- 
ment tool that uses the same delay model as ours and we have not been 
successful at makmg Ritual use our model 

One other issue that needs further attention is that of (nn assignment 
The qyproacfa in this paper assimies that the pin assignment is givoL 
In die design process, usually only partial pin assignment is givea. 
However, the quality of the final sohttion strongly depends on die inn 
locations. Therefore, we need to look into algorithns to find good 
pin assignment during synthesis. Such an algorithm can also be used 
to extend diis approach to handle sequential circuits by finding good 
placment for the latches present 

The opdmizatxons Ui at we have shown are technology mdependent 
We have not yet addressed die issue of technology mappmg. Also, 
we have completed ignored gate delays. We are presemly looking 
mto both of Uiese issues, ix. tedmology mapinng and how to best 
htcoiporate gate delays in our approach. 

Finally we are also boking into extemfing the proposed qyproadi 
to handle odier mteiconnect issues, like crosstalk and reliability. 

8 Condusions 

We have im}posed a new approach to deal widi die inoeasingly im- 
portance of wire delays m (teep submicroa technologies. It is based 
on die fact diat die shortest padi between any two pomts in a drcuit is 
die Manhattan distance between diem. We showed an example of why 
conventional logic ^yndiesis may produce drcuits where die munmum 
distance can not be achieved. 



The proposed approach sdll decouple logic syndiesis fdiase and 
place & route {toe. It consists of a constramt generadon step which 
produces a legal Boolean networic, which can be placed such that ev- 
ery padi is monotonic, and a consttamt-driven syndieas step which 
mimmizes die legal Boolean network while preserving legality. We 
show an example of how this approach can be extended to solve the 
slack-based synthesis problem. Finally, we describe directions f(ff fu- 
ture work which Includes an investigation into a new {dacement tool 
that works togedier with the proposed approach. 
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